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Abstract. Consider a Markov chain on the space of rooted real binary trees 
that randomly removes leaves and reinserts them on a random edge and suit- 
ably rescales the lengths of edges. This chain was introduced by David Aldous 
who conjectured a diffusion limit of this chain, as the size of the tree grows, 
on the space of continuum trees. We prove the existence of a process on con- 
tinuum trees, which via a random time change, displays properties one would 
expect from the conjectured Aldous diffusion. The existence of our process is 
proved by considering an explicit scaled limit of a Poissonized version of the 
Aldous Markov chain running on finite trees. The analysis involves taking limit 
of a sequence of splitting trees whose age processes converge but the contour 
process does not. Several formulas about the limiting process are derived. 
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1. Introduction 



A Cladogram is an unrooted semi-labeled binary tree. That is to say, every 
vertex that is not of degree one must have degree three. The vertices of degree one 
are called leaves which are labeled by {1, 2, . . . , n}, while every other vertex will be 
called internal vertices. It follows that if such a tree has exactly n leaves, it must 
have 2n — 3 edges. In [H |5] Aldous considers the following Markov chain on the 
space of n-leaf Cladograms. For a precise description we first define two operations 
on Cladograms closely following [5 , to which we refer the readers for more details. 

(i) To remove a leaf i. The leaf i is attached by an edge ei to a branchpoint 
b where two other edges 62 and 63 are incident. See Figure [l] Delete edge 
ei and branchpoint 6, and then merging the two remaining edges 62 and 63 
into a single edge e. The resulting tree has 2n — 5 edges. 

(ii) To add a leaf to an edge /. Create a branchpoint b' which splits the edge 
/ into two edges /2,/3 and attach the leaf i to branchpoint b' via a new 
edge /i. This restores the number of leaves and edges to the tree. 

Let Tn denote the finite collection of all n-leaf Cladograms. Let T, T'' be elements 
on Tn. Write T'' ~ T if T'' 7^ T and T'' can be obtained from T by following the 
two operations above for some choice of i and /. Thus a valued chain can be 
described by saying: remove leaf i uniformly at random, then pick edge / at random 
and re- attach i to f. In particular its transition matrix is 



This leads to a symmetric, aperiodic, and irreducible finite state space Markov 
chain. Schweinsberg [27 proved that the relaxation time for this chain is O(n^), 
improving a previous result in [5^ . We will refer to this Markov chain as the Aldous 
Markov chain in the following text. 

The problem of interest for us in this article is to establish a limiting diffusion 
for this sequence of Markov chains as n tends to infinity. It is not entirely rigorous 
at this point what we mean by a limit here. For example, what is our limiting 
state space? Detailed answers of such questions will be provided later. Roughly, 
our state space will be the space of continuum trees^ i.e., compact metric spaces 
without loops which support a probability measure on them. Hence we are looking 
for a continuous Markov process on the space of continuum trees that can be argued 
as a weak limit of the above sequence of chains. 

There are some technical problems of working with an unrooted Cladogram. 
Hence, we will make a small change in the above model and consider Cladograms 
in which a distinguished vertex in the tree at time zero is marked as root which is 
never removed and continues as the root of all subsequent trees. Instead of calling 
Cladograms, we will simply call such trees as rooted binary trees. Please see Section 
[2] to find formal definitions of the kind of trees we work with. 

When trees are rooted, it is natural to associate a family structure with it where 
the root is an ancestor, and every vertex gives rise to a number of children / progeny. 
We will frequently use this analogy whose meaning will be obvious from the context. 

1.1. Poissonization. We now introduce the important concept of Poissonization 
of the Markov chain described above. One challenging aspect of the Aldous Markov 
chain on Cladograms is that at every step of the chain the number of leaves remain 




if = 



T. 



Figure 1. Markov chain on Cladogram (reproduced from [5^) 



the same. Things become simpler if we adopt the Poissonized version of the Markov 
chain: every leaf has an exponential clock with rate 2 attached to it which deter- 
mines the time of their removal (deaths). Every edge also has an exponential clock 
of rate 1 at which point a new pair of vertices (an internal point and a leaf con- 
nected by a edge) is introduced on that edge. The rate of births and deaths are 
chosen to accommodate for the fact that there are about twice as many edges as 
leaves. We will refer to this model as the Poissonized Aldous Markov chain. 

This Poissonization changes the Markov chain when the number of leaves are 
finite. However, there are strong reasons to believe that the limiting diffusion for 
the non-Poissonized chain is equal in law the limiting diffusion of the Poissonized 
chain via a stochastic change of time. We provide an argument below to make our 
case. 

Going back to the Aldous Markov chain (unrooted), consider a branch point 
b in the tree T. It divides the collection of leaves naturally into three sets. Let 
X{T) = (Xi, X2, X3)(T) denote the vector of proportion of leaves in each set. It 
can be verified (see Pal [23]) that the Markov chain on Cladograms induce a Markov 
chain on these vector of proportions, which, when run at n^/2 speed, converges in 
law to a diffusion on the state space {(xi, X2, X3), > 0, xi + X2 + X3 = 1}. The 
law of this diffusion is called the Negative- Wright-Fisher or NWF(l/2, 1/2, 1/2). 

In Pal [23 it is shown that the NWF law can be obtained by considering ratios 
of squared Bessel processes of negative dimensions. A comprehensive treatment of 
BESQ processes can be found in the book by Revuz & Yor [26]. This family of one 
dimensional diffusions is indexed by a single real parameter (called the dimension) 
and are solutions of the stochastic differential equations 



where /3 is a one dimensional standard Brownian motion. We denote the law of 
this process by Q^. It can be shown that the above SDE admits a unique strong 
solution until it hits the origin. The classical model only admits paramater to 



(1) 




x>0, t>0, 
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be non- negative. However, an extension, introduced by Going- Jaeschke & Yor [15], 
allows the parameter to be negative. 

It is well known that BESQ process with a nonnegative dimension 9 is the diffu- 
sion limit, as the number of initial particles go to infinity, of a critical (discrete or 
continuous time) Galton- Watson tree with a rate of immigration 0. For example, 
this happens for a continuous time, binary Galton- Watson, every individual dies or 
reproduces at rate 2 and there is an infiux of new members at rate 0. It follows 
from the same proof that when is negative, immigration changes to emigration, 
i.e., at rate ^, members leave the family until there is no one surviving. 

The following construction of NWF processes can be found in [23 , Theorem 8]. 

Lemma 1. Let Z = (Zi, Z2, Z3) 6e a vector of 3 iid BESQ processes of dimension 
— 1. Let ( be the sum Yl^=i ^i- Define 

Ti = inf {t > : Zi{t) = 0} , r = Af^^T^. 

Then there is a A/'WF(l/2, 1/2, 1/2) process (/ii,/i27M3) for which the following 
equality holds: 

(2) /i,(4a) = -^(tAr), l<z<3, Ct = 

A very similar result holds when instead of considering one branch point we 
consider several branch points and look into the partition of leaf masses generated 
by these branch points. The vector of proportions admits a diffusion limit which 
can obtained via a random time change from independent BESQ processes (of 
possibly both positive and negative dimensions). Hence it is natural to search for 
a Markov chain on Cladograms such that leaf masses in partitions generated by 
branch points converge to BESQ processes. This is exactly what happens for the 
Poissonized Markov chain since the number of leaves in each partition evolves as 
an independent Galton- Watson branching process with immigration or emigration. 

Hence, Theorem [T] gives us the important clue as to the idea of poissonization of 
the Aldous' chain and the correct value of the rate of emigration which is crucial 
in the following. More examples of Poissonization can be found in Markov chains 
like the Wright-Fisher chain whose Poissonized version is a branching process with 
immigration. The diffusion limit of the former is the Wright-Fisher diffusion and 
the diffusion limit of the latter is the BESQ process. The time-change relationship 
between the two can be found in Pal p^. More reminiscent is the Fleming- Viot 
model where the Superprocess is the Poissonized version, and the theorem that 
allows us to obtain the Fleming- Viot from the Superprocess by a time and mass 
change is called the Perkins Disintegration Theorem (see Etheridge [11^). 

We end this discussion with an obvious lemma. 

Lemma 2. Consider an internal vertex on a finite tree on which the Poissonized 
Aldous chain is running. This internal vertex divides the leaves of the tree in three 
subtrees rooted at that vertex. One of them contains the original root. Consider 
the number of leaves in any of the other two parts. The process of total number of 
leaves in these subtrees follow independent continuous time binary Calton- Watson 
branching process with a birth and death rate of two and an emigration rate 1 until 
one of them hits zero. 
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Figure 2. Coding of a tree 

1.2. Coding trees by age. A fundamental concept at the heart of our argument 
is the so-called age of a random tree. Consider any model of generating a random 
tree starting with one individual. If after a units of time later the tree survives, 
then the age of the tree is a. This is particularly useful when we have one time-axis 
and several trees start sprouting at different points in time. At any given instance 
of time, we consider the set of trees surviving at that time. Their ages are given by 
the durations since they have sprouted. 

In this dynamic situation properties of trees can be described by its age. For ex- 
ample, suppose that the total existing population in a dynamically growing random 
tree is given by a continuous times Galton- Watson branching process. Then, given 
its age, one can deduce its size from the conditional distributions of Galton- Wat son 
branching process conditioned to survive up to time a. 

Now, one can actually code an entire tree by the ages of its various subtrees. We 
will discuss these ideas in much more detail later in the text. However, to give an 
idea of our strategy, we provide a simplified informal description below. 



d4_q4 



Consider the image of a finite tree in Figure |2] The variables {di, . . . , (ir} will 
be considered (positive) distances and the variables {qi^ . . . , q^} will be considered 
as (nonnegative) masses. Now we construct a stochastic process (741,^2,^3) as 
follows: Ai takes three values di < d2 < dj with probabilities proportional to 
gi,g2, and qj respectively. Given Ai the value of A2 is determined. 
Case (i) Suppose Ai = di, we imagine our process enters the first subtree which is 
marked with dashes. Then, X2 takes two values ds < d^ with probability 
proportional to qs and q^ respectively. 
Case (ii) Suppose Ai = d2^ then our process enters the dotted subtree, and A2 is d^ 

with probability one, irrespective of qQ. 
Case (iii) Suppose Ai = dj^ then A2 = with probability one. 

The reader can now guess the distribution of ^3, given the values of Ai^A2. In 
fact As = d^ with probability one if {Ai = di, A2 = 6^3}, or zero otherwise. 
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The importance of this stochastic process is that suppose the tree is absent to 
start with, and we are given the distances and masses for the stochastic process 
only. It is easy in a natural way (to be formalized later) to construct the tree out 
of the set of possible paths of this stochastic process. 

Somewhat more formally, one can construct a set representation of the tree 
(according to the definition in Aldous [3 ) from the support of the paths of the 
stochastic process. Note that, the measure representation of that set will not be 
given by the law of the stochastic process. By that we mean the distinct paths of 
the stochastic process are not equiprobable. Although it is not crucial in what we 
do, some discussion of this phenomenon needs to be done. Suppose we pick a leaf 
at random from the tree given in Figure [2] and consider the sequence of d^'s that 
appear ordered away from the root. Then, this provides another law of (Ai, A2, A3) 
which pick subtrees according to their leaf mass. However, one can easily choose 
qi^ such that a subtree with a small qi has a large number of leaves. 

In this article, as the tree evolves, new subtrees will be born and old subtrees 
will die. The masses g^'s will be given by the ages of the subtrees. Note that if the 
sizes of the subtrees grow according to a Galton- Watson law, it is quite possible 
that an older subtree has much less leaves than a younger one. 

Nevertheless we have made our case that a stochastic process whose coordinates 
are discrete random variables can code a tree in such a way that probability masses 
of entering a subtree are given by the age of the subtree. The leaf masses of the 
subtree can be recovered from their ages, and hence the two measures (given by 
age and leaf-mass) will be absolutely continuous with respect to one another. In 
particular, they will have the same support representing the tree itself. 

1.3. The content in this paper. The title of this article refiects the fact that 
several loose ends in this article will be settled in a follow-up article. A precise 
summary of what has been achieved in this paper and what will be done in the 
follow-up is presented below. 

What has been done in this article: 

(i) We study the Poissonized version of the Aldous chain on Cladograms and 
consider its natural limit on the space of real trees. 

(ii) We code trees by a stochastic process whose law is random. This is a 
different way of looking at the set representations due to Aldous. 

(iii) Consider the (random) law of the first coordinate of the said stochastic 
process. As the Poissonized chain runs, it induces a Markov chain on point 
processes on x R+ which ultimately hits the empty point process with 
probability one. We establish finite-dimensional convergence of an excur- 
sion of this point process to a limiting process of point processes as time 
(of the Markov chain), distance (on the tree), and mass (leaf-masses on 
the trees) are properly scaled. This limiting point of point process is very 
explicit. In fact, the marginal distribution and the transition mechanism 
are all obtained as explicit formulas. 

(iv) By using recursion one can similarly argue the (finite-dimensional) con- 
vergence of an excursion of the induced Markov chain on the joint law of 
finitely many coordinates of the stochastic process to a limiting law. 

(v) Now, we turn to constructing random trees coded according to these limit- 
ing laws at various times. We consider two time points and produce a joint 
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Figure 3. Excursion of a reflected Levy process 

coupling of random trees at these two time points whose corresponding sto- 
chastic processes are obtained via the limiting transition mechanism. An 
easy induction produces coupling at finitely many time points. As before, 
the formulas are all explicit. 

What we plan to do in a follow-up article: 

(i) Prove that our couplings produce a Markov process on real trees and that 
one can choose it to have continuous paths in the Gromov-Hausdorff dis- 
tance. 

(ii) That the Brownian CRT is an invariant distribution for the above Markov 
process. 



1.4. Outline of the strategy. Our strategy is the following. Consider again the 
tree in Figure [2] Assume that each edge is of unit length. One can think of the 
horizontal line from the root as a spine on which two subtrees are supported. As 
the Poissonized Aldous Markov chain runs, new subtrees appear on the spine and 
old subtrees disappear. The distances of these subtrees from the root is given by 
the index in which they appear away from the root. Thus at any instance of time 
one obtains a point process in x R+ recording the the distance d and the age q 
of the subtrees ordered away from the root. Hence the Poissonized chain induces a 
Markov process on point processes on x R+. If we find the limiting transition 
probabilities of this Markov process, this gives us how the law of Ai changes with 
time. The process of laws of A2, A3, and so on are given by the same transition 
mechanism due to the obvious independence of leaf growth and deaths in various 
subparts of the tree. 

Hence our central problem reduces to identifying the Markov process of point 
process of ages as determined by the Poissonized Aldous chain. This process has 
the following description in terms of a compound Poisson process with drift. Let 
A/' be a Poisson process with rate one and let (1X2^ - • - be an iid sample taken from 
the distribution function L{u) = !-(! + 2ii)~^/^, u>0. 
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Figure 4. Excursion of the age process 
Define the compound Poisson process with drift 

Xt = -t^Y.C,, t>o, 

i=l 

starting at zero until it hits zero. Let / denote the process of running infimum. 
That is 

It = inf{Xs, 0<s<t}, t>0. 
Consider the process X — I which is X reflected at its infimum. Then this reflected 
process can be seen as iid sequence of excursions above the origin. 

A typical image of such an excursion is shown in Figure [3j For every nonnegative 
level a on the ?/-axis, consider the sequence of jumps across level a. These are 
marked by dotted lines. Each of these jumps have an undershoot below level a, 
i.e., difference between a and where the process was right before it jumps across a. 
The sizes of these jumps ordered by the indices in which they appear form a point 
process in x R+. A moments reflection will show the reader that as we increase 
a, starting from 0, one obtains a Markov process on point process in x R+. 

Now, under proper scaling, the jump distribution of this Levy process does not 
converge to any nontrivial rate function. However, the renewal measure does, and 
hence the point process of undershoots indexed by their order of appearance has a 
certain limit. Notice that the distance on the x-axis plays no role in this analysis. 

Our limiting point process on ages of subtrees is very closely related to this limit 
in the following way. Consider Figure [4] We have here two independent instances 
Xq and Xi of excursions of the above reflected Levy process. Now, we interweave 
the paths of Xq and Xi in the following manner. Start with the first jump of Xq, 
then concatenate the path of Xi until it exceeds the jump of Xq. Then Xi has 
reached a new height. Concatenate the remaining path of Xq from where we left 
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until it exceeds the height of Xi. Alternate between the two processes until one of 
them hits zero and we stop. 

As before, at every nonnegative level, one can consider the point process given 
by undershoots below that level. Except now we reverse the order in which these 
undershoots appear. That is to say, the farthest undershoot from the ?/-axis will be 
labeled one, and the closest will have the maximum label. This family of point pro- 
cesses, indexed by level, provides the transition probabilities of the ages of subtrees 
on a spine in the Poissonized Aldous Markov chain on finite trees. When we take 
a scaling limit, we obtain an excursion measure on paths from (0, oo) to the space 
of point processes on x R+. This excursion measure completely determines 
the dynamics of the limiting age process and hence the tree itself by the previous 
argument. 

In a nutshell our construction of the limiting process is the following: given 
any two time points ao < ai, we construct a joint set-representation of a pair 
of continuum trees at those times whose transition mechanism is shown to be a 
scaling limit of the transition mechanism of the finite Poissonized Aldous chain. 
This set representation is achieved by a joint coupling of an iid sequence of random 
sequences at times ao and ai that produces consistent and leaf-tight families of 
proper /c-trees, and hence should determine the limiting continuum trees by taking 
a closure in Unfortunately, at this point we cannot prove that the sequences 
remain in although it is intuitively obvious from the calculations. 

The reader should note that there are three kinds of scaling parameters involved. 
One, is the natural distance on the finite tree which converges to the Lebesgue 
distance on the continuum limit. Two, is the density of leaves, that also converges 
to a probability measure on the continuum trees. Three, is the factor by which 
the Markov chain has to be speeded up to reach a diffusion limit. The relationship 
between the three is the following: when the speed is of order n, the distance 
should be of the order ^/n^ and the leaf-mass should be scaled by n. The fact 
that this is the only possible scaling can be verified by our discussion of NWF and 
BESQ processes. However, since we are not rescaling total leaf-mass to one, our 
continuum trees have leaf-mass measures that are finite positive measures. In fact, 
under the Poissonized diffusion, the total mass process evolves as a BESQ process 
of dimension —1. In particular, the process of trees ultimately dies. Note that, this 
total mass process is required in the rescaling the triplet (time, distance, mass) to 
obtain the Aldous diffusion from the limiting Poissonized diffusion. The stochastic 
change of time is given in ([2|, where ( is the total mass process. Furthermore, 
when we rescale the leaf-mass measure to one, one needs to rescale the Lebesgue 
distance on the tree by the square-root of the same factor. 

The of scaling-limit approach used in this paper is very different more the Dirich- 
let form technology used in the study of jump Markov processes on continuum trees. 
For example, the process inspired by the subtree prune and regraft (SPR) as stud- 
ied in the seminal article by Evans and Winter [T4|. In an SPR move, a binary tree 
T is cut "in the middle of an edge" to give rise to two subtrees, say and 
Another edge is chosen in T', a new vertex is created "in the middle" of that edge 
and the cut edge in T" is attached to this new vertex. Finally, the "pendant" cut 
edge in T' is removed along with the vertex it was attached to in order to produce 
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a new binary tree that has the same number of vertices as T. Please see \JA^ for a 
figure. 

This produces a macro-level change in the continuum tree and hence is reflected 
by a pure-jump Markov process on the space of real trees. 

The authors in [14] use Dirichlet form technique to prove the existence of a 
process on the space of continuum trees that mirrors the behavior of this discrete 
chain. One problem with that approach is that there is no convergence theorem 
from the Markov chain to the limiting process. 

2. Preliminaries on trees 

2.1. The space of discrete trees. We start with some notations. Let N denote 
the set of positive integers. Locally flnite rooted trees are usually labeled and 
ordered by a flnite sequence of integers as follows. The root of the tree is 0. The 
jth child of u = (iii,ii2, • • • ^ N^, is uj, where for two elements v G and 
w , the element vw belongs to N^^-^ and is the concatenation of the component 
sequences. Let l^il = n be the generation, or genealogical height, of u. Let 

CO 

(3) ^ •= U ^^^^^ = 

Formally a discrete tree is a subset T such that: 

(i) e r. 

(ii) li V = uj and j G N, then u eT. 

(iii) For every u G there is a nonnegative integer Ku < oo (the number of 
offsprings) such that uj G T if and only if j G {1,2,..., K^}. 

We write u ^ v if u is din ancestor of i.e., there is a sequence w such that 
V — uw. We denote hy u Av the most recent common ancestor of u and v. 

2.2. Real trees. The formalism for real trees is somewhat intricate and lengthy. 
We do not deal with all the details in this article and refer the reader to the excellent 
survey by Le Gall [20 from which we borrow the following description. Also see the 
articles by Evans and Winter [14 , Evans-Pitman- Winter [l7, and the preprint by 
Depperchmidt et al. [9 where extensive formalism for dealing with Markov chains 
on real trees have been deflned. We only consider compact real trees which can be 
deflned as follows. 

Definition 1. A compact metric space (T, d) is a real tree is the following two 
properties hold for every a, 6 G T. 

(i) There is a unique isometric map fa^b from [0, d(a, b)] into T such that 
fa,b{0) = a and fa,b{d{a,b)) = b. 

(ii) If is a continuous injective map from [0, 1] into such that q{0) = a and 
q{l) = 6, we have 

q{[0,l]) = fa,b{[0,d{a,b)]). 

A rooted real tree is a real tree (T, d) with a distinguished vertex p = p{%) called 
the root. 

Let us consider a rooted real tree (T, d). The range of the mapping in (i) 
is denoted by [[a, 6]] (this is the line segment between a and b in the tree). In 
particular, [[p, a]] is the path going from the root to a, which we will interpret as 
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the ancestral line of vertex a. More precisely we define a partial order on the tree 
by setting a ^ 6 (a is an ancestor of b) if and only if a G [[p, b]]. If a, 6 G there is a 
unique c G T such that [[p, a]]n[[p, 6]] = [[p, c]]. We write c = aAb and call c the most 
recent common ancestor to a and b. By definition, the multiplicity of a vertex a G T 
is the number of connected components of %\{a}. Vertices of ^\{p} which have 
multiplicity one are called leaves. Our goal is to study the convergence of random 
real trees. To this end, it is of course necessary to have a notion of distance between 
two real trees. We will use the Gromov-Hausdorff distance between compact metric 
spaces, which has been introduced by Gromov (see e.g. [16 ) in view of geometric 
applications. 

If {E, S) is a metric space, we use the notation SnausiK^ K') for the usual Haus- 
dorff metric between compact subsets of E: 

8Haus{K,K') = inf {e > : C U,{K') and K' C U,{K)} , 

where Ue{K) := {x ^ E : 5{x^ K) < e}. Then, if % and V are two rooted compact 
metric spaces, with respective roots p and p' ^ we define the distance dcHi^^^') by 

dGH{% 2:') = inf {SHaus (^(1), ^'(^0) V S{4>{p), <P'{P'))} 

where the infimum is over all choices of a metric space {E^ S) and all isometric 
embeddings (j) : T — > E and (j)' : T' — > E of % and %' into (E^S). Two rooted 
compact metric spaces Ti and %2 are called equivalent if there is a root-preserving 
isometry that maps Ti onto %2' Obviously dcHi^i^') only depends on the equiv- 
alence classes of % and V . Then dcH defines a metric on the set of all equivalent 
classes of rooted compact metric spaces (see [16 , 20 , 13 ). We denote by ^HTree the 
set of all (equivalence classes of) rooted real trees. 

Theorem 3 (See [13]). The metric space (IHTtee, ^Gi/) ^-^ complete and separable. 

Definition 2. Weighted real trees are compact real trees equipped with a prob- 
ability measure on its Borel <j-field (informally, the leaf distribution). Let 9^Tree* 
denote the set of all weighted real trees. Continuum trees are weighted real trees 
such that the support of the leaf distribution is the entire tree. 

We lift the following definitions and results from [14 . Recall the Prokhorov 
metric between two probability measures p and on a metric space (X, d) with the 
corresponding collection of closed sets denoted by 

dp{p, u) := inf {e > : p{C) < v {C) + e for ah C G CT} , 

where C := {x e X : d{x, C) < e}. 

An e- (distorted) isometry between two metric spaces (X^dx) and (F, (iy) is a 
(possibly nonmeasurable) map / : X — > Y such that 

dis(/) := sup{\dx{xi,X2) - (/(xi), /(x2))| ]Xi,X2 e X} < e. 

The following lemma is important in its own right. 

Lemma 4 ([T4j, Lemma 2.1). Let (Ti,di) and {%2^<^2) be two compact real trees 
such that dcHii^i^^i)^ {^2^^2)) < ^ for some e > 0. Then, there exists a measur- 
able 3e-isometry from %i to %2- 

We now define the weighted Gromov-Hausdorff distance between two metric 
spaces with probability measures (X^dx^i^x) and (F, (iy,z/y). For e > 0, set 

Fx Y '•— {measurable e-isometries from X to Y} . 
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Define 

A^^ := inf {e > : 3 / G Fx^y^9 ^ ^y,x such that 

dp {vx o vy) < e, dp {yx, i^y o g~^) < e] . 

A* satisfies all the properties of a metric except the triangle inequality. To rectify 
this, we define 

{n-l 

where the infimum is taken over all finite sequences of compact, weighted real trees 
Zi, Z2, . . . , with Zi = X and Zn = Y. 

Lemma 5 ([14], Lemma 2.3). The function dQ^ is a metric on weighted real trees. 
Moreover, 

^A*aH{X,Yy/' < d*aH{X,Y) < A^h {X,Yf' . 

Theorem 6 ([14], Theorem 2.5). The metric space (9^Ttee*, d^^:^) is complete and 
separable. 

The above theorem shows that the space of continuum trees is suitable for doing 
probability. 

2.3. Chronological trees. Chronological trees are particular instances of real 
trees. We closely follow the notations and description provided in the article by 
Lambert [19 . Also see the book by Jagers [18, Chapter 6] where they are treated 
as special cases of a general branching process. Informally, this a model of a pop- 
ulation where each individual exists during her lifespan^ and produces offsprings 
during the time of her existence. Splitting trees are random chronological trees 
where we are going to assume that lifetime of individuals are iid from some suitable 
law, and offsprings are produced at a constant rate. 

Somewhat more formally, such trees can be considered as the set of edges of some 
discrete tree embedded in the plane, where each edge length is a lifespan. Each 
individual of the underlying discrete tree possesses a birth level a and a death level 
uj, both nonnegative real numbers such that a < a;, and a nonnegative number of 
offsprings with distinct birth times belonging to the interval (a^uj). We think of a 
chronological tree as the set of all so-called existence points of individuals of the 
discrete tree. See Figure 2, reproduced from [T9] . 

More rigorously, consider the space of discrete trees in (|3| and let 

(4) U = ^ X [0, 00) 

and set p = (0, 0) G U. We let Pi and P2 respectively to be the canonical projections 
of U on Z// and [0, 00). The first projection of any subset T C U will be denoted by 

r, 

r := Pi {T) = {ueT : 3(t > 0, {u, a) G T} . 
A chronological tree T is a subset of U such that: 

(i) p G T and is called the root. 

(ii) T is a discrete tree. 

(iii) For any u e there are < a{u) < u{u) < 00 such that {u, a) G T if and 
only if cr G {a{u)^uj{u)]. 

(iv) For any u and j G N such that uj G T, Oi{uj) G {a{u)^uj{u)). 
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Figure 5. A chronological tree (reproduced from [191 P- 353]) 

(v) For any u G T and z, j G N such that m, uj G T, 

j ^ a{ui) ^ a{uj). 

For any u G a{u) is the birth level of uj{u) is the death level and we denote 
by ({u) its lifespan uj{u) — a{u). 

Note that, condition (iii) above implies that if T is not reduced to p, then the 
root of T has a positive lifespan C(0)- 

We now discuss planar embedding of a chronological tree. This is merely formal- 
izing the way we have drawn Figure 2. The most important rule is: edges always 
grow to the right. For any x G T, we denote by 6{x) the descendants of x, that is, 
the subset of T containing all z G T such that x z. The descendants of x can be 
split int its I (eft) -descendants Oi{x) and r(ight)- descendants Or{x). Their definitions 
are as follows: if x is not a branch point, Oi{x) = 9{x) and 9r{x) is the empty set 
0; if X = (ix, cr) is a branch point, then a = a{uj) for some integer < j < K^^ and 

'= U ^ + ^) ^r{x) := {x} U y 0{uj, a + e). 

e>0 e>0 

Note that we have adopted the convention of putting x itself as its own r-descendant. 

Finally we need to define a proper notion of depth-first ordering for chronological 
trees. We define the following linear order following ^3 p. 356]. 

Linear order '^<'\ Let x,y G T. Unless x Ay G {x,?/}, either y G Or{x A y) or 
X G Or{x Ay). Define a total order by 

X < y <=^ y x^ or X G Oi{x A y) <^ y x^ or y G Or {x A y). 

Note that the genealogical order and the linear order are reversed. 

There is another, more classical, way of describing a chronological tree. This is 
called the age process. The following description is for a finite chronological tree. 
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Definition 3. (The Age process) For every level cr G [0, oo) consider the collection 
of existence points {(ii^,cr), < i < mcr} ^ T, where rricr is some nonnegative 
integer depending on a. We assume that Ui^ are ordered in the linear order < 
described above. This results in a point process 

Age(cr) = {{k, (7 - a{uk)) , /c = 0, 1, . . . , m} . 

In words, Age(<j) keeps track of all the existence points at level a and their current 
ages. The process {Age(cr), a > 0} is called the age-process. 

A splitting tree is a random chronological tree characterized by a a-finite measure 
A on (0, oo] called the lifespan measure, satisfying 

I (r A l)A(dr) < oo. 
io 

Given a lifespan measure, let denote the law of a splitting tree starting with 
one ancestor individual having deterministic lifetime (0, x], where x is only allowed 
to be oo when A({+oo}) > 0. This, informally, means that for each individual v 
of the tree, conditional on a{v) and uj{v)^ the pairs {a{vi)^C,{vi))iyi made of the 
birth levels and lifespans of v's offspring are the atoms of a Poisson measure on 
{a{y)^ix){v)) X (0, oo] with intensity measure LEB A. Here LEB refers to the 
Lebesgue measure. In addition, conditionally on this Poisson measure, descending 
subtrees issued from these offsprings are independent. 

This can also be expressed in terms of grafting (please see |^ for definitions). 
Suppose ^(r,T, x,i) is the tree obtained by grafting on T at x, as descending 
from Pi(x)i. Let (a^, Ci)i>i be the atoms of a Poisson measure on (0, x) x (0, oo] with 
intensity measure LEB A. Then V. is the unique family of probability measures 
on chronological trees satisfying 

t'= y <?(T„,Ox(0,x),(0,a„),n), 

n>l 

where conditionally on the Poisson measure, the T^'s are independent splitting 
trees with law 7^^^ , and T has law V(i . 
Finally, set 

m := rA((ir), 
Jo 

and we say that T is subcritical, critical, or supercritical, according to whether 
m < 1, = 1, or > 1. 

We now define (as in ^9\) the jumping chronological contour process (JCCP) for 
a finite splitting tree. Informally, when the tree is finite, it is the distance from 
the root of a path on the chronological tree that starts from the root and traverses 
the tree in a depth first order in the following manner: at any branchpoint, the 
path jumps up to the end the lifespan of the individual born at that branchpoint 
and then decreases linearly (at speed —1) along the lifetime of this individual until 
it encounters a new birth event. When the contour process ends its visit of an 
individual, its value is thus the birth level of this individual. It then continues to 
visit its mother's lifetime at the level where (when) it had been interrupted. The 
basic theorem we require is that the JCCP for a splitting process is a Levy process 
with a Levy measure A. This follows since the jumps are iid and appears at rate 
one, according to our construction of the splitting tree. A figure of a JCCP for a 
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Figure 6. A typical JCCP (reproduced from [19, p. 370]) 

finite chronological tree is shown in Figure 3. For more pictures and details, please 
see JW . 

We now present the formal definition closely following [19 . We start with the 
exploration process. From now on, T will denote a finite chronological tree with 
finite total length i := A(T). The real interval [0,£] is equipped with its Borel 
(7-algebra and the Lebesgue measure. For any x G T, set 

S{x) :={y eJ : y <x}. 

It has been shown in [19 that the map (p :T ^ [0,£] given by 

(p{x) := A(§(x)), X eT 

is the unique measurable bijection that preserves the Lebesgue measure on both 
metric spaces and the order. Note that 

(p{0,uj{0)) = 0, and (p{p) = i. 

Definition 4. The right-continuous inverse {(p~^{t); t G [0,£]) is called the explo- 
ration process. The process given by the second projection, i.e., 

is called the JCCP, or, the jumping chronological contour process. 
The main result we need now follows. 

Theorem 7 (JCCP for splitting trees. Theorem 3.3 and 4.3 in [19,). The JCCP 

for a chronological tree is cddldg and given by the expression 

Xt = -t^ C{v), 0<t<t 

Consider the law of a splitting tree which goes extinct almost surely. Then X 
has the law of a spectrally positive Levy process of finite variation, with a Laplace 
exponent 

POO 

ip{0) = 0- (1 - e-^^) A(dr), 6> > 0, 
Jo 

killed upon hitting zero. 
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For definitions of spectrally positive (or negative) Levy processes, please see the 
classic text by Bertoin [71 Chapter VII] from where we will use all the results we 
need in the following text. Note that when the tree is finite the JCCP is nothing 
but a compound Poisson process with drift. 

3. Preliminaries on point processes and subordinators 

Point processes can be, of course, treated under the rubric of the theory of 
random measures which is going to be our approach here. A standard reference to 
the theory is the book by Daley and Vere- Jones [8], Chapter 7. 

Let (X, p) be an arbitrary complete separable metric space equipped with its 
Borel cr-algebra Bx- A Borel measure /i on X is boundedly finite if < oo for 
every bounded Borel set A. Let JVx denote the set of all boundedly finite integer- 
valued Borel measures on X. Then there is a natural metric on JVx that turns it 
to a complete separable metric space. The corresponding Borel a-algebra is the 
smallest a-algebra with respect to which the mapping /i m(^) is measurable for 
all A e Bx- Finally we include an extra point in A/x to represent the empty 
measure. 

Definition 5. The following definitions and results are standard and can be found 
in, for example, [8 . 

(i) A random point process X on X is a probability measure on J\fx • The finite- 
dimensional distribution refer to the collection of laws of finite dimensional 
random vectors 



where n G N and Ai, . . . , are disjoint bounded Borel subsets of X. The 
law of N is characterized by its finite-dimensional distributions. 

(ii) Let u he di a-finite nonnegative measure on X. A Poisson point process 
(PPP) with an intensity measure is a random point process on X whose 
finite dimensional distributions are given by independent Poisson random 
variables with means (z^(Ai), . . . , u{An)) 

(ii) Recall that a Radon measure on a locally compact space is one that puts 
finite mass on compact sets. Consider a sequence of Radon measures {fin} 
on a locally compact space. We say that the sequence converges to a vaguely 
to a Radon measure ja if 



for all continuous / vanishing outside a compact set. It follows that if 
the underlying space is a complete separable metric space then a family of 
Radon measures {fia} is relatively compact in the vague topology if and 
only if {jiia{A)} is bounded for every bounded Borel set A. 

(iv) On a complete separable metric space weak convergence of random mea- 
sures (including point processes) is quivalent to the weak convergence of its 
finite dimensional distributions to the appropriate limit. 



{N{Ai),...,N{An)) 




We now state a classical result about convergence to Poisson point processes. 
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Lemma 8. Let {/i^, n e N} be a sequence of Borel probability measures on X. 
Suppose V is a a -finite measure on X such that njin converges vaguely to v . 

For every n G consider an iid sequence of random elements Y\ Y2 nj ^ • • • 
with law fin . Define a point process on R+ x X by 

CO 

where 5x puts a unit mass at x. Then the sequence {Zn-, n G N} converges in law 
to a PPP on IR+ x X with intensity measure LEB x v, where LEB stands for the 
Lebesgue measure on the positive half-line. 

Proof This is a standard result. Hence we only outline the proof. It suffices to show 
(see [HI Proposition 9.1. VII, page 275]) that for any bounded continuous function 
/ : X X ^ R that vanishes outside a bounded set, we have the following limit 



lim £^exp 



■^f{k/n,Xk,n 







= exp 


[-/7 




. Jo Jx 



(1 - e--^^^'^)) u{dx)dy 



Suppose, without loss of generality, f{yr) vanishes whenever y > 1. Then, the 
above limit follows by first proving it by approximating /c/n by , the kth smallest 
order statistics in an iid sample [/i, . . . , [/^ from Uni[0, 1], for which the limit is 
immediate, and then showing that the approximation is asymptotically negligible. 

□ 

We will be working with explicit densities and the following lemma will come in 
handy. The locally compact space mentioned in the next lemma will be often in 
our case the positive quadrant in R^ without the origin. 

Lemma 9. Let j/^, n > 0} be a sequence of positive continuous functions in a 
locally compact space. Suppose 

(i) lim^T^^oo fn{x) = fo{x) for all X. 

(ii) The family {/n, n > 0} is uniformly locally bounded. 

Let fin be the a -finite measure whose density is given by fn with respect to some 
base Radon measure. Then fin converges vaguely to fiQ as n tends to infinity. 

Proof. It is known (see [8]) that it suffices to show that if A is a compact set then 
fin{A) converges to fio{A)^ as n tends to infinity. But fn is bounded on A and hence 
converges in on A. This completes the proof. □ 

3.1. Poisson Additive Point Processes (PAPP). We introduce the concept of 
an Poisson Additive Point Process. This is a Markov process of point processes on 
R+ X X, where X is some complete separable metric space. We need the following 
definitions. 

Let {^5, 5 > 0} be the right shift operators on the first coordinate of R+ x X, 
i.e., 

6>s(x, y) = {x -\- s, for all s, x > 0, y ^ X. 
We need the following definitions. 

Definition 6. Let (A^, t G /) be a jointly-defined family of point processes on 
(0,00) X X indexed by some subset / of R+. Let At{-,X) denote the marginal 
measure of the first coordinate. A concatenation of the family (A^), denoted by 
^tei^t^ is an operation which is permitted only when almost surely 
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(i) each At{-,X) is supported on a compact interval [0,rt], 

(ii) either each A(-,X) has no atom at 0, or each A(-,X) has no atom on 

(iii) and, Xlg<^ < oo for every t > 0. 

When it is permitted, it is defined to be another point process Z on M+ x X such 
that for any bounded Borel subsets B C IR+ x X we have 



Intuitively one can think of {At) as a family of marked point processes on M+ 
with marks in X. The above definition means that we are simply concatenating 
the distribution functions of At{-,X) while retaining the marks on atoms. 

Definition 7 (PAPP). A Poisson additive point process, or PAPP, is a homoge- 
nous Markov process {Nf, t > 0} on the space of point processes on x X with 
the following property. For every t > 0, there is a Poisson point process Z on 
R+ x A x A/r+xX7 whose law depends on t, with the following property. Let / de- 
note the set of atoms of A, A/r+xx)- Consider the two derived point processes 
(y, W) defined by 



Then (F, W) is a pair of jointly defined point processes on x A and the regular 
conditional distribution of W given Y is the transition kernel at time t for the 
Markov process N. 

The previous definition is required to formalize the evolution of the age process 
of a splitting tree. Hence they have a natural relation with spectrally positive Levy 
processes. However, as we will see later, the PAPP is a more general concept and 
can exist even when there is no Levy process corresponding to it. 

We need a few more standard operations on point processes. 

Definition 8. Let {Bt^ t G /) be a countable family of point processes seen as 
random measures on a complete separable metric space (A, p). A superposition of 
the family is a random measure B such that for all Borel subset A C X we get 



Le., the set of atoms of B is the union of the atoms in the family (5^, t ^ I). 

Definition 9. Let A denote a point process xR+. A translation of ^4 by a vector 
{x^y) G is a point process which is a translation of the measure corresponding 
to A. We will denote this point process by A + (x, y). 

The necessity of the following definition will not make sense until later in the 
text. 

Definition 10 (The restriction consistency property). A family of point 
processes {Bt^ t > 0} on the state space x R+ is said to satisfy the restriction 
consistency property if the following holds. Consider any pair of indices to < ^i- 
Consider Bt^ restricted to the subset IR+ x [ti —to^oo). Then, there is a subset 
D C R+, such that the collection of atoms of the second coordinate under Bt^ 
restricted to D x is identical to the collection of atoms of the second coordinate 
of the restricted measure Bt-^ translated by (0, —ti -\- to). 




r = Z(.,.,A'm+xx), and W = eteiZ{t,X,-). 



B{A) :=J2MAy 



tei 
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3.2. The Stable(l/2) subordinator. A big role in this analysis is played by the 
Stable(l/2) subordinator and its associated point process. Hence we briefly define 
the process and list some of its properties. A more detailed account can be found 
in the book by Bertoin [7 . 

Definition 11. The Stable(l/2) subordinator is an increasing Levy process X 
taking values in [0, oo). It can be characterized by its Laplace exponent 

^(A) :=-ilog^(exp-X,) 

which is proportional to >/A. The jump distribution I{{dx) is proportional to 
li{dx) := x~'^l^dx on (0, oo). Thus its jumps are naturally associated with a Poisson 
point process on (0, oo) x (0, oo) with an intensity measure LEB x /i. We call this 
point process as the Stable(l/2) point process. 

4. Streets, mailman, and the address protocol 

A convenient space for embedding real trees is the Banach space of sequences 
that are absolutely summable. The following description goes back to Aldous' 
original paper [2 . 

{The representation of continuum trees) Let {ei, i G N} denote the standard 
basis of ii which is 1 in the ith coordinate and zero elsewhere. Let (L^) denote a 
sequence of positive numbers. We inductively define a real tree as follows. Let ui 
be the origin and let 

71 = lii + ei[0,Li] := {ui + xei, < x < Li} . 

Now we continue inductively. Suppose we have defined Tk and we select a point 
Uk-\-i from Tk and set 

7fe+i =TkU {uk^i + ek+i[0,Lfe+i]} . 

Let T be the closure in of the union Un>i7^ then T is a real tree with a length 
metric derived naturally from i^. 

To get a random real tree we randomly choose (L^) and (un)- For example, a 
beautiful result due to Aldous states that if (L^) are the interarrival times times 
of a Poisson process on with rate tdt and Un is is chosen uniformly at random 
from the existing tree structure, the resulting real tree is the Brownian CRT. 

The representation leads to the following coding of a continuum tree by a 
stochastic process. Let denote the set of nonnegative numbers and let 
denote the space of Borel measures on R+. We assign an address system to our 
tree which is formally the law of a (possibly random) discrete-time stochastic pro- 
cess whose state space is R+. We show below that every stochastic process that 
satisfies certain properties naturally codes a continuum tree, and conversely, for 
every continuum tree, one can find a (actually, infinitely many) stochastic process 
that codes the tree. We start with a definition. 

Definition 12 (Streets). A street is a triplet {v^I^R) where /, are nonnega- 
tive real numbers and is a finite positive discrete measure supported on [0,/]. 
Throughout the text we will refer to / as the length of the street, v as the measure 
on the street, and R as the clock. The distribution function of v will be denoted 
by V itself. In particular, v{a) refers to the v measure of the set [0,a]. 
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Figure 7. The 1st street 

Definition 13 (Mailman and the address protocol). Given a countable collec- 
tion of streets S = {{vn^ In^ Rn)^ ^ = 1, 2, . . .}, one can define a stochastic process 
in the following way. A pair of real-valued stochastic processes j^^yl^^^^n)? ^ — 
1, 2, 3 . . .} defined on a probability space (1^, P) will be called a mailman on the 
streets S if it satisfies the following properties. 

(i) Every is a Bernoulli random variable with 

p(^^ = 1) = ^^'-^^^ . 

n 

(ii) Given = 1, the law of is given by normalized to have mass one, 
while given = 0, we take = In- 

(iii) Moreover ess sup^^-^ An < oo. 

Thus the sequence (Ai, A2, . . .) is a random element in . The law of the mailman, 
considering it as a probability measure on will be called the address protocol. 
Finally, the plural of mailman will be mailmen. 

The way a continuum tree is coded by the address protocol follows exactly the 
representation. Recall Aldous' definitions of a proper /c-tree which is nothing 
but a rooted Cladogram with k leaves whose edges have a real length. 

Definition 14 (Proper /c-trees). Let {7l{k); /c > 1) be a family of random proper 
k-trees. For j < k let (L^, . . . ^Lj) be uniform random choice of j distinct leaves 
of lZ{k). The family is consistent if, for each 1 < j < /c < 00, the reduced subtree 
generated by these leaves have the same law as 

Definition 15 (Leaf-tight property). With the preceding notation, call the family 
(7^(A:), k>l) leaf-tight if 

min d (L^, L^) A 0, as /c ^ 00, 

2<j</c ^ 
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where d is the natural graph distance on the real tree. 

According a fundamental result by Aldous a continuum tree is uniquely specified 
by a consistent family of proper k-tiees which satisfies the leaf-tight property. We 
state the results below (a bit sloppily in order to avoid brining in new definitions 
and formalism). 

Theorem 10. [3 , Theorem 3] (i) Let (lZ{k)^ k > 1) be a consistent family of proper 
k-trees. Suppose the leaf-tight property holds. Then there exists a special continuum 
random tree (5, fi) with the following property. Let {Zi) he an exchangeable sequence 
directed by the random measure fi. Then, for each k, the reduced subtree generated 
6^/ Zi, . . . , Z/c has law lZ{k). 

(a) Conversely, for every continuum random tree, the above mechanism defines 
a family of random trees {JZ{k)) which is consistent and leaf-tight. 



The mailman defines a consistent family of proper /c-trees in the following man- 
ner. For every fixed k > I, let (A(l), (A(2), ^(2)), . . . , (^(A:)) be k iid 
realizations of the mailman. Consider first (^(l), (^(1)) and {A(2).,^(2)). Define the 
stopping time 

m = mm{n > : ^^(1) ^ An{2) or / c;^(2)}. 

Then, we build a proper 3-tree with a marked Root and leaves 1 and 2 according 
to the following recipe. Let v denote the most recent ancestor of 1 and 2. The 
distance between the Root and the vertex v is given by 

{m m ^ 

^Ml), ^A,(2) I. 
i=0 i=0 J 

The distance between the Root and the two leaves are given by (respectively) 

oo oo 

J2M1), and ^Ai(2). 

We can now proceed by induction. Suppose we have drawn a {j + l)-tree with a 
marked Root and labeled leaves 1, 2, . . . , j using (A(l), (^(1)), . . . , (A(j), ^(j)). Con- 
sider + 1), + 1)) and define 

mtj^i = mm{n > : An{t) ^ A^iJ + 1)} or (^^(t) ^ + 1), t = 1,2, . . . , j. 

Then we add a leaf labeled (j + 1) at a distance of Xl^o ^n{j + 1) from the Root 
such that the most recent ancestor between leaf (j + 1) and leaf t is at a distance 

{mt,j + i ^t,j + i "I 

^ A,(j + 1), Mt)\ 
i=0 i=0 J 

from the Root. Note that, this amounts to selecting such that mt*j-\-i = 
max^ m^^j+i, and attaching an edge at the most recent ancestor between leaf {j + 1) 
and of appropriate length. 

This gives rise to a family of random /c-trees which is consistent by construction. 
Note that the distance between leaf 1 and leaf j is given by 

oo oo ^ 

d{j, 1) := J2 M^) + E - 2min i E ^"(j)' E ^r^{l) \ 

n=0 n=0 ln=0 n=0 J 
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Note that every Ai{-) is supported on count ably many atoms. Thus, given the 
path of {A{l)^e{l))^ the iid random variables maxi<j</e m^^i tend to infinity in 
probability as k tends to infinity. Moreover, since ess sup^^ An(-) is bounded, 
we get mini<j</c d{j^ 1) goes to zero in probability. This proves that the family of 
random k-tiees generated by the mailman is leaf-tight. 
Hence we get the following result. 

Lemma 11. A countable collection of streets such that there is a mailman on the 
streets defines a unique law on the space of continuum trees. 

The reader might be wondering why we kept the variables This is due to 
a fundamental reasons for which we provide the following heuristic. The above 
construction is very similar to the string-of-beads representations in the recent 
papers by Pitman and Winkel [25] and Haulk and Pitman [17]. These are all 
related to the Chinese Restaurant construction of a random tree (please see the 
above cited papers). Recall the Chinese restaurant process for creating a random 
permutations (please see Chapter 3 in the excellent lecture notes by Pitman [24J). 
Any new customer can either choose one of the existing tables, or can create a 
new table on her own. As will become clear in the following text, this process of 
creating a new table is being taken care of by the variables For example, consider 
Figure [7l There are five subtrees growing on four internal vertices on the spine at 
a distance di^d2^ds^ and d^ away from the root. One can construct a street by 
associating five positive numbers with these five subtrees ai, . . . , 0^5. Then R = a^^ 
I = 6^4, and the measure u given by fi{{di}) = for z = 1, 2, 3, 4. 



We begin the discussion with some results on a standard Galton- Watson tree with 
emigration. Consider a continuous time binary branching process {Z^, t > 0} with 
rate two and an additional rate one of emigration. In other words, the generator 
for this process is given by 



Cf{x) = 2x [fix + 1) + fix - 1) - 2fix)] + [fix - 1) - fix)] , xeNU {0}, 



with an absorbing state zero. We denote the family of such process, starting with 
an given number of individuals, by GW( — 1). Let denote that the law of a 
GW(— 1) process starting with r individuals. 

Lemma 12. Consider the continuous time binary branching process with emigra- 
tion rate 1 starting with one individual. Let ctq be the first time the population hits 
zero. Then, the Laplace transform of cfq, under , is 



In particular, the mean of <Jo under P^ is one. 

Proof. We will use a variation of the generating function technique. For any non- 
negative integer r, define 



5. Galton- Watson trees with emigration 




CFr := inf {t > : Zt = r} 



where the infimum of an empty set is infinity. 
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Let Ur{t) denote the density of at time t when the chain starts from r + 1, 
and let Vr{t) denote the probabihty 

Vr{t) = {Xt = r, and X, > r, for ah s < t) . 

Now, for any r > 1, that event {Xt = r, Xs > r for ah s < t} can be decomposed 
into disjoint events where (i) either the chain stays at r throughout, (ii) or, for any 
choice of {u^ s) such that u-\- s < moves one step up to r + 1 at time li, returns to 
r at time s for the first time, and stays > r for the rest of the duration along with 
Xf = r. By the strong Markov property, one gets 



(5) Vr{t) = e-^^^+i)^ + f 2re-(^"+i)^ f Ur{s)Vr{t 

Jo Jo 



u)ds. 



Now, for a positive number ^, define the transforms 

pOO POO 

Fr{0) = / Ur{t)e-^^dt, Hr{0) = Vr 

Jo Jo 



{t)e-^^dt, 



Gr{0) 



4r 



Then, by Fubini-Tonelli, we get 



2rFrHrGr 



2r 



f 

Jo 



Vr{t) 



u-\-s-\-w=t 

-{4r-\-l)t 



e-^^dt, by ([5|, 



-et 



dt 



1 



r > 1. 



= Hr{e)-Gr{0). 

Rearranging terms from above, we get 

^ 1 - 2rFr{0)Gr{0) (4r + 1 + 6>) - 2rFr{0) ' 

Now, for any r > 0, one can do a last passage decomposition to the event 
{ar = t}, given {Xq = r + 1}, by considering the event that the chain remains 
> r + 1 during the time interval [0, t), is at r + 1 at time t— , and the next jump it 
makes is exactly at time t when it lands on the value r. More rigorously, for e ~ 0, 
we get 

rt+e 

Ur{s)ds = P"^^^ {t<(Jr<t^e) 



(6) 



= J2 P"""^ {Xt=r + k, X,>r + l,Q<s<t) {ar < e) 



fc=i 



By dividing the above expression by e and taking limit as e 0, we get 

Ur{t) = {2r + 3)Vr+i{t). 

Hence, 

2r + 3 



Fr{e) = {2r + Z)Hr+i{e) 



{Ar + b + e)-2{r + l)Fr+i{e) 
r + 3/2 



(2r + 5/2 + 9/2) - (r + l)F,+i(^) ' 
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By recursion of the the above identity we arrive at the following continued func- 
tion expansion: 

(7) p (0) = 3/2 I _ 5/2-11 _ 7/2-2 1 _ 9/2 - 3 | _ 
^ ' °^ ^ 1^/2 + 5/2 1^/2 + 9/2 1^/2 + 13/2 \ 9/2 + 17/2 

Now, it is known (see [21, p. 278, eqn. 3.3.3]) that the (complementary) incom- 
plete Gamma function 



>,z) := / 

J Z 



satisfies the following continued fraction expansion 

1| 1(1-^^)1 2(2 -a) I 3(3 -a) 



1 -\- z — a 



z — a 



a \ 7 -\- z — a 



z > 0, a eC. 



Putting a = —3/2 and z = 0/2 in the above expression and comparing it with 
([7| we immediately get 

3/2 



(8) 



Foie) 



J/2 



G) ^ 



(-3/2,0/2) 



2^8 



ge/2^3/2p(_3/2^^/2). 



We now apply integration by parts to the incomplete Gamma function for a 0: 



Tia,e) = 



1 



T{a+1,0). 



Thus 



r(-3/2,0/2) 



3 
2 

= r 



-e/2 



-e/2 



-3/2 



-3/2 



;r(-i/2,^/2) 



2e-^/2 



-1/2 



2r(l/2,6>/2) 



^^-9/2^-3/2 _ ^^-9/2^-1/2 ^ lr(l/2,^/2). 

3 3 3 



Since V^(i9) = Fo(6>), we get 



2VS 



fi/2^9/2 



2V8 



-9/2Q-3/2 



4^2 



^/26>-^/^ + -r(l/2,6>/2) 
3 



3 3 

v2 Je/2 
This completes the proof. 

Lemma 13. The density of 2ao under is given by the function 

(9) M^) = ^(i + ^)"'/', ^>o. 

Hence if we let L{u) = P^{ao < u), then L(u) := 1 - L{u) = (1 + 2u)-^/'^ 



□ 



Proof. We simply invert the Laplace transform of ctq as obtained in the last lemma. 
We work with V^(26>): 

E(e-^2"°) =^{20) = ^e^^3/2r(-3/2,^). 
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By applying scale and shift transforms to (|8| we get 
^(0) = te'd^l^ j e-H-^'Ht = Y e-'^u-^'Hu, t = 0u 

By the uniqueness of the Laplace transform this proves the formula for the density. 
The expression for L{u) follows easily. □ 

We also need an idea about the size of the tree conditioned to have survived for 
long. We start with a natural definition which will be useful later. 

Definition 16. Suppose that the GW branching processes started with one indi- 
vidual as time a G M and survives until time a + ctq. The age of the branching 
process is defined for times t e [a, a + ctq], the age at time t being t — a. 

Lemma 14. Consider the process {Zt^ t > 0), a GW{—1) starting with one indi- 
vidual Then 

E {Zt I cro > = 1 + 2t. 
In particular lim^^oo ^ {Znt/n \ ctq > nt) = 2. 

Proof. The process + 1 is a martingale as the function u{x,t) = x -\-t satisfies 

du ^ ^ 
at 

We apply the Optional Sampling Theorem at the stopping time t Aao to get 

E{Zt^,^) = l-E{tAcTo). 
The right side of the above equality can be evaluated explicitly thanks to Lemma 



E{tAao)= [ L{s)ds= [ — - 
Jo Jo (J- 



ds _ ^ 1 



25)3/2 vTT2t* 

Thus E{ZtAao) = 1/a/1 + 2t. Since Zt^ao = when t > ctq, and P((To > t) = 
(1 + 2t)-3/2 we get 

This completes the proof. □ 

It is a well-known result due to Yaglom that for the critical GW process, without 
emigration, the law of the conditional random variable Z/n given ctq > nt converges 
to an Exponential random variable as t is kept fixed and n tends to infinity. Hence, 
it is natural to expect that a similar limit theorem holds for GW(— 1). However the 
known proofs in the case of no emigration does not carry over to the present case. 
We present a different proof using martingales. 

Lemma 15. Consider the (generalized) Laguerre orthogonal polynomials 

(x) = -—- (e-^x^+") , X > 0, n = 0, 1, 2 . . . . 

Fix an X > and define the function 

f{n) = L^^lf{x), n>l, /(0):=lLT^(x)=0, 
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Then, if Z is a G W(— 1 ) process then 
(10) Mf 



t > 0, 



is a martingale. In particular, for x = Q, the process 

(11) 

is a martingale which is the scale function of the Markov chain. 

Proof. We recall the following recursions satisfied by the Laguerre polynomials: 

(12) L(")(x) = L(r'\x)-L^:X'\x) 

(13) n4")(x) = {n + a)L^:\{x)-xL^:X'\x). 

Please see Abramowitz and Stegun [1 for a proof. 
Now, for all n > 1, we have 

£/(n) = 2n [/(n + 1) - /(n)] + (2n + 1) [/(n - 1) - /(n)] 



= 2n [4^/2) (3.) _ 43/2) (a.)J + (2n + 1) 

= 2n4i/2)(x)-(2n+l)4^_/f(x). 

Here we have used the first of the two recursions. 
Now using the second recursion, we get 

Cf{n) = 2 



L 



(3/2) 
n-2 



{X) 



nLl^/'\x)-{n^l/2)L^^!_l\x) 
-2xf{n). 



-2xL^^!_l\x) 



Thus 



2xf{n)+Cf{n) = 0, n > 1, 
which proves that is a local martingale. The required integrability conditions for 
being a true martingale follows by comparison with the critical GW (no emigration) 
which is stochastically larger. 

The claim (11) follows from a standard formula of L^\{)). Note that gives 
the scale function of the Markov chain since it is increasing and vanishes at zero. 
Also note that when Zt is large, is proportional to Z^^'^ which is the scale 
function of the diffusion approximation stated below. □ 

We can now prove our corresponding version of Yaglom's theorem. 

Lemma 16. Let Z he a GW(—1 ) process. The sequence of laws of random varia^ 

Znt 

n 

converges to an Exponential random variable with mean 2t. 



(Jo > , n > 1 



Proof. Using Optional Sampling Theorem to the martingale Mf in (10) for the 
bounded stopping time t A ctq we get 

1 = ^Mf^,„ = Ee^^*fiZt)l{ao > t}. 

Since we know that P{cro > t) = (1 + 2t)~^/^, we can rearrange the last equation 
to derive 

(14) E\L%/^l{x)\ao>t]=( 



*(l + 2i)3/2. 
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Now an alternative series expansion of the Laguerre polynomials is the following 



e--4"^(x) = £(-ir 



i=0 



a-\- n -\- i\ X 



Putting expectations on both sides as in (14) we get 



i=0 



1/2 + Zt+i 
Zt-l 



ao>t 



x" 



Both sides are entire expressions in x, and hence by comparing coefficients we 



get 

+ i + 1/2 
Zt-\ 

Replacing t by nt we get 



E 



ao>t 



(l + 2t)^+^/^ i = 0,1,2,.... 



(15) 



E 



Znt^i^ 1/2' 
Zflt ~ 1 



(Jo > nt 



(1 + 2nt) 



i+3/2 



For the rest of the proof we will denote expectations with respect to the conditional 
law of by E\ 

By using Stirling's approximation formula 



r{az + 6) - V^e-^"(az)^"+^-l/^ 
we readily obtain that for fixed > 0, as n tends to infinity. 



ny^i ^1/2 
ny-1 



(ny) 



i+3/2 



r(i + 5/2)' 



Hence 



lim E* ^ 

n^oo \ 2nt 



i+3/2 



r(i + 5/2). 



The algebra generated by the class of functions {1} U {y^^'^ • ^ > 0} can 
uniformly approximate any continuous function on compact sets. Hence an argu- 
ment similar to the moment method identifies the limiting distribution of Znt/2nt 
uniquely as Exponential with mean one. This completes the proof. □ 

Finally we state a standard diffusion approximation lemma. 

Lemma 17. Let Z he a GW(-l) starting with nx individuals. Then, as n tends to 
infinity, the rescaled process 

^-ZnU t > 

converges in law to a BESQ process with drift —1 starting from x. 

For a proof (without the drift term) see Chapter 9 on Ethier and Kurtz [12]. 
The proof with the drift is almost identical. The following collection of results can 
be found in the article by Going- Jaeschke and Yor [15^ . 
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J 

Root A 

n n 

I r T T Q 

Root D C B A 

Figure 8. Evolution of a street 

Lemma 18. Let denote the law of a BESQ process of dimension ^ G M and 
starting from x > {), until it hits zero. 

For any 6 > —2 and any x > {), Q~^{Tq < oo) = 1, while, for 6 < 2, one has 
Q^(To<oo)=0. 

Moreover, for <2, we have 

(i) To is distributed as x/2G, where G is a Gamma random variable with pa- 
rameter (^/2 + 1). 

(ii) The transition probabilities y) for x^y > satisfy the identity 

As a final remark note that these transition probabilities are explicitly known as 
non-central chi-square distributions. 
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Root D C B A 




Figure 9. Evolution of a street 



6. Evolution of a street 

Consider the Poissonized Markov chain as described in the Introduction. We 
consider a spine, i.e., a branch, in the tree that supports several subtrees rooted 
on it. As the Poissonized chain runs its course, these subtrees either continue to 
exist or vanish at some point, and new subtrees appear supported on the spine. To 
distinguish the time according to which the Markov chain runs, we will call this 
indexing set to be level Thus the Poissonized chain at level a would mean the state 
of the chain at the end point of the time interval [0, a]. 

Our purpose in this section is to describe, at any level a, the respective sizes 
of the sequence of existing subtrees supported on the spine (if any) as we move 
away from the root. Of course, this depends on the initial condition. We begin our 
discussion starting with a spine that supports just one root and two subtrees (see 
the top image in the Figure [8| . We call this a sapling. The only internal vertex is 
marked by A. As long as this internal vertex exists, it corresponds to an internal 
edge to its left where, at rate one, new rooted subtrees can grow. At some point of 
time the subtrees can look like the second image in Figure [S] Rooted subtrees have 
grown at internal vertices A^B^C and D. There are five internal vertices including 
the root with equal edge lengths between them. There are five subtrees growing 
from these internal vertices, one each from vertices C, and two from A. Call 
the rightmost subtree growing from vertex A the clock subtree. 

Note that each of the internal vertices correspond to an internal edge on their 
left on which new leaves sprout at rate one. Hence one can associate a natural 
family order among these subtrees where we say subtree 1 is a child of subtree 2 if 
subtree 1 sprouted on the internal edge that was on the immediate left of the root 
of subtree 2. Clearly each existing subtree produces children at rate one as long as 
it is rooted at an internal vertex. As we have argued before, the number of leaves 
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of these subtrees follows a GW(— 1) process, and hence the lifetime of each subtree 
is given by the distribution L in Lemma [13] 

Note that the internal vertex A is special since two rooted subtrees have grown 
from A. In all these images the rightmost clock subtree has a special status. This 
subtree has no internal edge between itself and the next subtree on its left. Hence, 
this subtree does not produce any child. Naturally the distribution of lifetime of 
this tree is also given by L. When this subtree dies out, the internal vertex to 
which it is attached dies too. Note that, this internal vertex may not be A, since 
the subtree that started with leaf J (see previous figure) could have vanished earlier. 
However, it is some internal vertex, and possibly the Root itself. 

Suppose, this is not the Root, but some other internal vertex. In that case the 
resulting picture is stochastically similar. In Figure [9] the top image represents 
the street the moment before the clock subtree attached on the right of A dies. 
According to the rule of the Markov chain, the internal vertex A disappears, and 
after some edge rescaling, the vertex B becomes the final internal vertex which 
gives rise to two subtrees out of which the rightmost one is the new clock subtree 
that does not produce any children. 

This process continues until no internal vertex is left, save the root, and ulti- 
mately every leaf dies. 

Streets, as defined in Definition [T2j arise naturally in this context. Consider the 
sapling in Figure [Sj As we will see later, we will attach positive numbers aj and 
a J called the ages of the subtrees / and J. Then a street {v^ /, R) is defined as 
follows. There is only one internal vertex other than the root. Hence we take / = 1, 
corresponding to the fact that this internal vertex is at a unit distance from the 
root. The measure v has only one atom, at 1, and ^^({1}) = olj- The clock R is 
also taken to be aj. 

As the Poissonized chain evolves, so does the street. Suppose at level a we arrive 
at the configuration given in the second image in Figure |8] Then, at level a, the 
street is given by the following. There are four internal vertices including the root 
with equal edge lengths between them. There are five subtrees growing from these 
internal vertices, one each from vertices C, and two from vertex A. For each 
of these subtrees, other than the clock subtree, we define the their ages as follows. 
If the root of the subtree did not exist at time zero (i.e., the internal vertex did 
not exist), the the age of the subtree is given by the difference between levels a and 
the level at which that internal vertex was born. If the internal vertex existed at 
time zero, then its age is exactly a. For the clock subtree, the age is the sum of the 
amount of time it is the clock subtree plus its age just when it became the clock 
subtree (i..e, the then current clock subtree vanished on its right, or that it was the 
clock subtree to begin with). 

Let Qfi, . . . , as denote the ages of the subtrees growing arranged in order away 
from the root. Then, to define the street, we take / = 4, R = a^^ and u is given by 

= ai, i = 1,2,3,4. 

It is now intuitive that the evolution of the street starting from a sapling plays 
the role of an excursion of the evolution of the street under the general Poissonized 
Markov chain. Our aim is to describe the limiting excursion measure of this process. 

We remark at this point that this definition of the street will be slightly altered 
in the following text when we will scale edge lengths. This will be refiected in the 
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Let us forget for the moment that the rightmost subtree does not produce chil- 
dren. In fact, consider that the subtree / in Figure [S] never dies. In that case, 
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every subtree that grows survives until a random time whose distribution is L, and 
during that hfetime produces children at rate one on the left. The resulting process 
is nothing but a splitting tree, starting with one individual, and with a lifespan 
measure given by L, except that the children are produced to the left. This, how- 
ever, can be easily corrected by reversing our direction of viewing and looking at 
the subtrees from the end away from the Root. A typical JCCP is the top image 
of Figure fTo] 



Let us now explain the sequence of images in Figure 10 The top most image 
represents the JCCP of the splitting tree if we remove the constraint that when 
the rightmost subtree on the street dies whichever is then the rightmost does not 
produce any children for the rest of its lifetime. Since we are looking at the tree 
from its rightmost end, the leftmost jump in the top image (the singleton jump that 
is not connected with the rest of the image) of the JCCP represents the lifetime of 
the subtree that started with the subtree /, the first clock subtree. The rest of the 
image is the JCCP of a splitting tree with one ancestor and a lifetime distribution 
given by L. Since the mean of L is one, the resulting splitting tree is critical and 
hence the JCCP hits zero almost surely. 

The second image in the sequence has two types of lines, solid and dotted. The 
dotted lines represents the lineages in the tree that do not exist since the rightmost 
clock subtrees on the street do not produce any children. The horizontal dotted 
lines represent the death times of the current clock subtree. Note that a subtree 
could have produced children before it becomes the clock subtree that does not 
produce any child. 

The third image represents represents the JCCP when the dotted parts are 
removed. This gives us, at every level, the age process of the current survivors 
and their distances from the root, and the age of the current clock. Suppose that 
the initial two jumps are given by (Vb, Vi), then at every level we obtain a street. 

The fourth image displays that the third image corresponds to a sequence of 
excursions of the underlying Levy process that defines a JCCP starting from a 
level and ending when it crosses another level. More precisely, define the running 
maximum process in a usual manner. The nth excursion starts at the end-point 
of the (n — 2)th excursion and ends when it jumps across the level of the current 
maximum. This continues until an excursion hits zero and we stop there. To make 
the distinction of the usual JCCP and the new one, we will call it the reduced 
JCCP. Without being pedantic we write down the following definition. 

Definition 17. For a finite splitting tree, a reduced JCCP is a path obtained 
from a full JCCP corresponding to the evolution of a sapling (Figure |8| following 
the rules described in Figure [lOj The initial pair of jumps that forms the sapling 
(Vo, Vi) will be called the initial parameters of of the reduced JCCP. 

Our underlying intuition is the following. The sequence of jumps above the 
current maximum for the full (not reduced) Levy process which runs at all time is 
the ladder-height process of another Levy process which is a Subordinator. We will 
show that this process converges to the Stable(l/2) subordinator which will give us 
the ages of the clock subtrees. Moreover, given the start and the end levels, each 
excursion is independent of the past and produces a limiting Poisson point process 
of ages across various levels. The final age process is a superposition of the point 
processes from various excursions which gives us the measure on the street. 
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As the Poissonized Markov chain proceeds, the distribution of leaves in each 
subtree can be described by describing the evolution of the triplet that consist the 
street. The scaling constant will be kept fixed throughout the evolution of the 
chain. Finally, note that the measure on the street can be empty (i.e., / = 0). 

Now, consider the evolution of streets as displayed in Figure 4. The last result 
shows the limiting age process of the family as it evolves under the Poissonized 
Markov chain. However, for the final picture, because of the way we have drawn 
streets in Figure 4 (Evolution of a street), we need to time-reverse our point pro- 
cesses at every level a to get place the root at the left-most end and such that the 
rightmost subtree does not reproduce. 

6.1. Infinite splitting forest with lifespan measure L, We start with the 
following set-up. Consider the probability distribution of the time to extinction of 
a Galton- Watson tree with an emigration rate one. Explicitly, as derived in the 
previous section, 

I(ii) = l-L(ii) = (l + 2ii)-^/2 

Let Nt denote a Poisson process with rate one and let Ci ? C2 , • • • be iid distributed 
according to L. Then, define 

Nt 

(16) X, = -t + ^0, ^>0, Xo = 

Thus X is a spectrally positive Levy process whose Laplace exponent is given by 
Lemma [121 

poo -< 

(17) ^{6) =6-1^ / e-^"L(dr) = ^^^/'e^/'r (1/2,^/2). 

If we consider the process X whenever it is nonnegative and including the jumps 
crossing zero, the process is the JCCP of a countable forest of binary trees which 
are all attached at the root to the half line. At time zero, a countable number of 
individuals already exist with their respective ages. The Poissonized Markov chain 
is allowed to run on this forest where new trees can grow on the half-line in between 
two existing trees. For one individual as an ancestor we would stop this process 
once we hit zero. However, for this forest we would consider the process running 
for all time. 

At any point in time, the ages of the trees, when indexed by the order they 
appear, away from the origin can be thought of as a point process on the half-line. 
As time moves, this becomes a Markov process on the space of point processes. In 
this section we identify the limiting Markov process as a PAPP. Note that, since L 
is a probability distribution with mean one, the resulting forest is critical. 

The main subtlety in doing this is that, by Lemma [13) there is no proper scaling 
under which the contour process X converges to a spectrally positive Levy process. 
However, as we will show, the age-process does converge. This will be achieved by 
a combination of classical Wiener-Hopf techniques and results on ruin probabilities 
of spectrally positive Levy processes. See Bertoin chapter VII]. 

We start with a lemma which follows since X is a martingale. 

Lemma 19. The process Xt is point recurrent. 

Definition 18. Let us make the following definitions which we will follow in the 
rest of the article. 
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(i) For any spectrally positive Levy process X, we will denote the dual process 
—X by X, which is now a spectrally negative Levy process. 



(ii) We consider a Poisson point process as in (16) with a continuous jump 
distribution L. Consider a nonnegative level a. Define stopping times 

Qa = inf {t > : = a} , Ta = inf {t > : Xt > a] . 

Each Ta will be called an upcrossing of level a, while each Qa will be called 
a downcrossing or a return to level a. Since X makes only upward jumps, 
it is clear that, almost surely, any two occurrences of Qa contains a unique 
occurrence of r^, and vice- versa. 

(iii) We will call the random variable X^^ — a to be the overshoot at level a and 
the random variable a — X^^- to be the undershoot at level a. 

(iv) When Xq = a, the law of the overshoot / = X^^ — a is called the ascending 
ladder height distribution and is clearly independent of a. 

(v) For the above process X, we denote by X^ the process obtained by time- 
reversal at Qa^ i.e., Xf := a — X(^^_^)+, t > 0. The following equality is 
easy to see for arbitrary random walks: the law of the process X^ in the 
time interval [0, ^o(-^^)] is the same as the law of X — a in the interval 
[0^ Qa] when Xq = a. In particular, the random variable J = a — X^^- also 
has the ascending ladder height distribution. 

We now define the scale function of the process X. 

Lemma 20. Let W : [0, oo) [0, oo) he the unique continuous increasing function 
with Laplace transform 



(18) / e-^''W{x)dx 



f 

Jo 



1 _ V20-^/^ 
(p{0) ~ e^/2r (1/2,^/2)' 



Then for every x^y >0, if we define cjx^y to he the stopping time ax,y = inf{t > : 
Xt ^ [-y,x]}, then 

^ yi W{x + y)- 

Moreover, W satisfies the following properties. 

(i) W(0) = 1 and, asymptotically as x tends to infinity, we get W{x) ~ 

3 ^1/2 

( i i ) W L = 1 -\- W, where ^ denotes the convolution operator. 

(iii) W admits a continuous density on (0,oo). 

Proof. The process X as defined in the above lemma has only downward jumps 
(i.e., spectrally negative). The ruin probabilities of such a process is given by a 
scale function. The first claim in the lemma is hence a corollary of Theorem 8 in 
p. 194]. 

Proof of (i) : For the asymptotics consider the Laplace transform of the function W 
as approaches zero. The function r(l/2,6>/2) converges at 0+. Hence, it follows 
that 



/ 2 

/ e-^^W(x)dx- W-6>-^/^ as 0^0 

Jo V TT 



In other words, the Laplace transform of W is regularly varying at zero with index 
—3/2 and a slowly varying function given by a constant. 
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We now apply a well-known Tauberian theorem for monotone densities (note 
that W is an increasing function). See, for example, [71 p. 10] to infer that 



(19) W{x) ^ \l --x'^^ ^ ^=x'^\ as x^oo. 

TT 2 v27r 

Similarly, as approaches infinity, we get (see [1, p. 263]): 
r(l/2,^/2) - (^/2)"'/'e-^/' 

Hence 

e-^^W(x)dx - 6>-\ as ^ oo. 



An application of the same Tauberian theorem and appealing to the continuity of 
W at 0+ completes the proof of the lemma. 

Proof of (ii) : Note that by usual transformation rules of Laplace transforms 

) 1 
e-^''W{x)dx : 



^{6) 1 - e-^^L{x)dx 
By rearranging terms we get (ii). 

Claim (iii) follows from (ii) by a convolution series expansion. □ 

Lemma 21. Fix an initial level Xq = a > 0. Then the random variables J a = 
a — Xr^- and la = X^-^ — a both have a density on (0, oo) given by (1 + 2s)~^^'^ , 
i.e., 

{la >u) = P\Ja >u) = G{u) := Jl^ , u>{). 

Vl + 2ii 

More generally, suppose Xq = 0^ then the joint density of {la-, J a) is given by 
(20) P° {la e du, Ja e dv) = L'{u + v) {W{a) - W{a -a^v)^l{v> a}) dudv, 
where L' is the density of the jump distribution L. In particular, 

{Ja e dv) = {W{a) - W{a -aAv)^l{v> a}) L{v)dv, and 
^'^^^ P^ {lo + Jo > x) = xL{x) + G{x), x>0. 

Proof of Lemma [2Tj Since the law of the jump are given by L, the density of the 
ascending ladder height is given by L. This follows from the classical Wiener-Hopf 
factorization and can be found in, for example. Theorem 5.7 in |6l 237]. 

For the joint distribution we use the expression derived in Doney and Kyprianou 
in See Example 8 (Spectrally positive proesses) on page 9 (use q = and 

U = W and integrate over the third variable). 

P^ {la e du, Ja e dv) = yL'{u + ^) y W{a - dy)j dudv 

= L'{u + v) {W{a) - W{a - a A v) + l{v > a}) dudv. 

Here we have used the fact that W has a mass of 1 at zero and after that it has 
a continuous density. The expression is consistent with the marginals when a = 0. 
The final claim follows by trivial integrations. □ 
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(22) ^a = PM inf X, >0 



Now, consider the sequence of returns of the process X to the level a. When 
X is the JCCP of the splitting tree, several such returns are performed until the 
process X hits zero and gets killed. By the Markov property, the law of the process 
in between each such return is iid. Hence the number of such returns is determined 
by the probability 

^a = P"f inf X, 
Lemma 22. The required probability discussed above is given by 

^o<i<Qa^^' ^ W{a)' 

Moreover, 

(i) the following holds for any two levels < ao < ai . For any < x < ai we 
have 

(23) 

P"" (ja, e dx, inf Xt>0]= P° (J„,-ao e dx) - ^1^; {Ja, G dx) . 

\ o<t<Ta-, J W[ai) 

(ii) Hence, when oq = ai = a, the conditional density of J is given by 

(24) 

rJv) := ^P" ( J„ G dv I inf Xt > o] = ^P" ( J e dv \ inf Xt > 

dv \ o<t<T„ J dv \ o<t<ea 



W{a) 

(iii) As n tends to infinity 



(25) gl{v) := lim V2n^ rna{nv) = 1 - \ v''/^, 0< v < a, 

exists as a rate function on the interval (0,a). 

Proof Clearly Qa is the probability that, starting at a, X exits the interval [0, a] 
through a. By a translation and using Lemma [20| we get 



W{a) W{a) 
For claim (i), we note the following. 



Ja, e dx, inf Xt>0] = ( J„, G dx, go > t^, ) 

V 0<t<T„j J 

= P"" [Ja, e dx) - P"" {Ja, e dx, 00 < T„J 

= (Ja, e dx) - P''° {go < Ta,)P° {Ja, G dx) 

= P° {Ja^.ao G dx) - ^^^^~"°^ P° ( Ja, G dx) . 

For claim (ii) we take = ai = a > 0. Note that the density is supported 
on < V < a. Thus, from the above expression and using Lemma [21] we get the 
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density to be 

ra{v) = 



L{v)-^^{W{a)-W{a-v))L{v) 



As a sanity check, note that from Lemma |20j claim (ii), we get 

which shows that is indeed a probabihty density. 

For (iii) we simply use the asymptotics of the W and L functions. □ 

We now need some asymptotic estimates. 

Lemma 23. Consider the Stahle(l/2) suhordinator, i.e., the increasing Levy pro- 
cess with a jump distribution given by Il{dy) = y~^^'^ , y > 0. Let a^u > 0. Then, 

(i) then the overshoot distribution, 1^^^ := X^^^ — na, satisfies 

lim P'(/r_ >nu)=Ha{u), 

n^oo 

where Ha{-) := 1 — Ha{u) is the distribution of the overshoot above a level 
a for the Stable(l/2) subordinator. 

(ii) Moreover, the joint density fn of {Ina/^i Jna/^) under admits the fol- 
lowing pointwise limit on (0, oo) x (0, oo).* 

lim /n(i^, v) = — 7=('^ + v)~^^'^ i\fa — \/a — a /\v) . 
n^oo 2v27r 



(iii) As n tends to infinity, the marginal density of Jnal^ under converges 
as follows 

3 

lim hn{v) = ha{v) := -—v~'^^'^ {\fa — y/a — a Av) , ^ (0, oo). 

n^oo 47r 

(iv) Consider two positive levels < ao < ai. Consider the density of J^ai/n, 
under the conditional probability P'^^o (• | info<^<r^^ Xt > O) . Then, for 
V G (0, oo); the sequence of densities converges pointwise to 



Ki^) = 1- - 



^/al - ao 
Jax 



ai 

Proof. We first consider claim (i). Consider the process X starting at zero and con- 
sider the supremum process Mt = supo<5<^Xs. Then, plainly, M is an increasing 
process that increases only when the process X jumps above the level of the cur- 
rent supremum. Hence, its jumps are iid with a distribution given by the ascending 
ladder height distribution G in Lemma 21 However, it is not Markovian, since the 
inter-arrival times of these jumps, although iid, are not Exponentially distributed. 

However, for any positive level a, the overshoot distribution, M^-^ — a, does not 
depend on the distribution of the interarrival times. Thus the overshoot distribution 
is the same as the overshoot distribution above a for a discrete time random walk S 
with a increment distribution G. Thus, let ^1,^2, • be an iid sequence of random 
variables each with distribution G and define 6*0 = and Sn = £,i -\- - - - (.n- Then 
Sr^ has the same distribution as M^-^ which has the same law as X^-^. Thus the 
overshoot distribution is given by the law of 5'^-^ — a. 
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Since G(u) is asymptotically (2u) it follows that the rescaled process 



(26) >^/"^ = ^^u^|, ^>0 



1 



n 



converges in law (in D[0, oo)) to a Stable(l/2) subordinator Y. The result now 
follows by using standard arguments. 



lim fn{u,v) = -{u^v) —= [yfa - Va - a Av ^ lim n ^^'^l{v > a}] . 

n^oo 2 \/27r ^ n^oo / 



Now for claim (ii) we use the explicit joint density of (/na, Jna) from Lemma 21 
fn{u, v)=n^^{l^ n{u + v))-^^^ {W{na) - W{n{a - a A v)) + l{v > a}) . 

Now taking n going to infinity and using the asymptotic properties of the function 
W in Lemma [20] we get 

3 / X _^ /o 3 

This completes the derivation of (ii). 

Claim (iii) is very similar and can be seen by integrating out u from (ii). We use 
the explicit expression of hn from Lemma [21] 

lim hn{v) = lim n(l + 2nv)"^/^ {W{na) - W{n{a - a A v)) + l{v > a}) 

n^oo n^oo 

(2v)"^/^ [^/a - y/a- a Av) = (V^ - Va- aAv) . 



27r ^ ^ 47r 



Claim (iv) is a direct consequence of claim (iii) via Lemma 22 claim (i). This is 
because 

^""^ e dv I Tna, < Qo] = ^7 ^----P^^^ (j^^Jn G dv , iuf X, > 



Taking limit as n goes to infinity and using (iii) above we get the limit as 



^ y^ai — ao 



^ai-ao(^) ^ ^ai(^) 



/ai 

This completes the proof of the lemma. □ 

6.2. The age process. Consider now the process {X^, < t < oo}, where Xq = 0. 
We are going to construct a family of point processes, called the age process, which 
will be indexed by levels a > 0. 

Definition 19. (The age process for the infinite forest.) Consider the se- 
quence of successive upcrossings of a level a > 0, i.e., ra(l), ra(2), . . ., and their 
corresponding values of the undershoots Ja{i) = a — Then the point pro- 

cess defined by {(/c, Ja(/c)) , k = 1,2,...} seen as a family of point processes indexed 
by the level a is called the age process. 

The rescaled age process will be the family of point processes 



(27) Age„(a):=^,5, 
scaled by a factor n. 



{k/V2^,Jr,aik)/n) 

k=l 
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For every level a, the sequence {Ja{k)^ k e N) is a sequence of iid random 
variables with distribution function G via Lemma [2l] Since, for any > 0, we get 

lim V2nG{nu) = 

n^oo 

the sequence of measures defined by 

fin ([^,00)) G{[nx, 00)) 

satisfies that \f^iin converges vaguely to the a-finite measure 00) = x~^l^ . 

By Lemma |8] the rescaled age process converges weakly, as n tends to infinity, 
to a PPP on x R+ with an intensity measure LEB x /i. We sum this up in the 
following lemma. 

Lemma 24. Fix a level a > 0. Consider the sequence of scaled age-process at level 
a, i.e., {Age^{a)^ n G N}. Then, as n tends to infinity, the above sequence converges 
in law to a Stahle(l/2)- PPP, Age{a), on [0, 00) x [0, 00) with rate LEB x ja, where 
00) = 6-1/2. 

Thus the process {Age^(a), a > 0} has marginal convergence to the Stable(l/2)- 
PPP. Now we ask the question of joint convergence. One can prove (as shown 
below) that the finite-dimensional distributions of this Markov process, after suit- 
able rescaling, converges to what turns out to be the finite-dimensional distribu- 
tions of a Stable(l/2)-PAPP. Kolmogorov's consistency theorem then establishes 
the existence of a limiting stochastic process of point process whose every marginal 
distribution is a Stable(l/2) point process. 

Theorem 25. Consider {Age^{a)^ a gM.} as a stochastic process of point processes 
indexed by a. Then there is a limiting Stable {l/2)-PAPP, say {Age{a)^ a G M.}, 
such that the finite dimensional distributions of Age^ converges to the finite di- 
mensional distributions of Age. Moreover, {Age{a)^ a > 0} satisfies the restriction 
consistency property in Definition\iO\ 

Proof. Let us start with convergence of a pair (Age^(ao), Age^(ai)) for some pair 
of numbers ao < ai. By translation, we can assume that ao = and ai = a > 0. 

Note that, every upcrossing above level zero has a chance of producing a finite 
number of upcrossings above level na before the process returns to zero. Let's 
denote by J the undershoot at level zero (i.e., — and H to be the finite point 
process (possibly empty) of undershoots at level a associated with one excursion 
above zero. By the Markov property of X, for each excursion above zero, we get iid 
copies of (J, n) where the H's get concatenated to give us the age process at level 
a. By Lemma [sj one can expect the iid sequence of (J, H) labeled by their indices 
to converge to a PPP on the product space of and the space of marked point 
processes from which the conditional law of Age (a) can be recovered by taking a 
projection. 

To do this precisely, we keep track of five random elements associated with one 
given excursion above level zero: 

(i) Jq- The value of the undershoot at level zero. 

(ii) Iq - The value of the overshoot at level zero. 

(iii) Ca - The indicator which is one if the excursion above zero goes above na. 

(iv) Jna - The first undershoot at level 71(2, in case Ca — 1, otherwise 0. 

(v) Zn - The rescaled age process of additional undershoots at level na in case 
ea = 1, otherwise 0. 
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The joint distribution of all these five elements can be described as below. 



(i) The joint law of (/q, ^o) is given in Lemma 21 

(ii) Given (Jo,/o), we now find the distribution of e^. Note that is 1 with 
probability 1 if /q > na. If Iq = nb < na^ the probability that the process 
X starting from Jq will reach zero before na is given by W{na — nb)/W{na). 
Thus, compactly, we can write 

P(e, = I Jo,/o = nb) = l-P{ea = l \ JoJo = nb) 

W(na — nb) , ,,,, , 
— — where W(x) = 0, x < 0. 
W{na) 

(iii) Now suppose we are given Jq and Iq < na^ and = 1. For n large, the 
law of Jna is given in the limit by the conditional law described in Lemma 
[23] (iv). 

(iv) The rest of the age process at level a, given {e^ = 1}, is given by a sequence 
of iid undershoots, distributed as r^, indexed by {1, . . . , A/'a} where Na is 
distributed as Geometric (1 — Qna)^ and is independent of everything else. 

Consider the joint distribution /i of {Jq, Iq, Ja^ Z) as a probability measure on 

1] := R+ X R+ X {0, 1} X R+ X A/'r+xr+. 

Now let jiin be the joint distribution of {Jo/n^ lo/n^ Cna^ Jna/n^ Z) (note that Z is 
already rescaled). That their joint density converges to the density of a a-finite 
measure u on Q can be established by multiplying their marginal and conditional 
densities as given by various parts of Lemma [23] Then, by Lemma [9] it follows 
that ^/2rl|JLrl converges vaguely v (the verification that the densities are uniformly 
locally bounded follows easily from the explicit expressions). The description of v 
follows from each of the limiting distributions, in particular, Z converges to (either 
the empty measure, or) a PPP of an independent Exponential length. 

The existence of a Poisson point process on x ^ with a rate function LEB x u 
follows from Lemma [Sj The age process at levels and a can now be recovered 
from this limiting point process by taking suitable projections. 

A repetition of the same argument produces the limiting age process at any 
finitely levels < ai < . . . < ak- The consistency of the finite-dimensional dis- 
tributions before taking the limit is preserved under vague convergence. By Kol- 
mogorov's consistency theorem we have established the existence of the limiting 
stochastic process {Age(a), a G R}. 

Finally, the restriction consistency property is a finite-dimensional property that 
is obvious for Age^ and is clearly preserved under the limit. □ 

We will need to enlarge the scope of the last theorem slightly. For any x, > 
consider the Levy process X, starting from 0, until the exit time dxy of the interval 
{—ny^nx). Then the limiting age process can be similarly defined for this stopped 
process. 

Theorem 26. For any < y < x, let X^'^y denote the stopped process X , which 
starts at ny, and stopped once it exits the interval (0,nx). Construct the process 



Age^ as in Definition 19, ^4/50 define 

4"^ = ^ {X.. 



n 
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Then there is a limiting stochastic process of Point process H^'^ and a limiting 
ransom variable such that the joint distribution of (Age^^Xx^^), under this law, 
converges in law to {H^^y ^X^) as n tends to infinity. Moreover the conditional 
distribution ofX^, given X^ > 0, admits the following description. Let S denote a 
Stable{l/2) subordinator, starting from y, and let 1 — Ha denote the distribution 
function of the overshoot above level a as defined in Lemma [23} Then, for all b > 0, 
we have 



p^y {Xx >b\Xx>^)=\i 



x-y 



H.-y(b) 



X-y 



Hx(b) 



Proof. We skip the details of the convergence of the age process which are exactly 
the same as in the previous case. 

The description of the conditional law of X^ follows by an argument similar to 



the proof of Lemmas 22 



P"^ ( Inx e du, inf > = {Inix-y) ^ du) - EM^y)lpO (/^^ ^ du) . 



Taking limits and using Lemma 23 (i), we get 



P(Xx>b\Xx>^) 



This proves the lemma. 



x-y 



-Hx{b) 



□ 



6.3. Evolution of a street. We now come to the main result of this section. 



Consider the reduced JCCP (Definition 17) in Figure 10, in particular the final 
image. Note that the reduced JCCP has two initial parameters (Vb,Vi) and a 
sequence of excursions. The two initial parameters determine the first two jumps 
denoted by (Vb,Vi). These, in turn, determine the following: Vi determines the 
level at which the first excursion starts and max(l^,Vi), determines at what level 
we stop the excursion (unless it has already hit zero). Our claim is that, conditioned 
on the values of {Vq^ Vi) being suitably large, the process of streets of the reduced 
JCCP has a limit. 

Definition 20. For any level a > and a scaling parameter n G N, consider the 



reduced JCCP as in (27) with the initial parameters nvo^nvi > 0. Consider the 



sequence of successive upcrossings of a level na > in reverse order from right to 
left. We separate the undershoots below level na in two parts, the left most jump 
(the rightmost in the reverse order), which corresponds to the clock subtree, and 
the rest of them ordered from right to left. Define the rescaled street at level a, 
denoted by Street (a), to be the triplet 

Streetn(a) = {un{a) Jn{a) , Rnict)) , 

where 

(i) nRn{a) is the age of the clock at level na which is given by the length of 
the rightmost undershoot below level a (if any). 

(ii) ^/2nln{a) is the number of upcrossings across level na, not counting the 
clock. In particular it can be zero. 
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(iii) Finally v is the point process defined by 

V2n/^(a) 

Streetn(a) := ^ ^{k/V2^,Jr,aik)/n)^ 
k=l 

where Jna{') are the undershoots below level na arranged from right to left. 
In case the jump exceeds na, the jump corresponds to rightmost of the two 
initial jumps, and we define Jna{') to be the sum of a and ai. 

Theorem 27. Consider the sequence of processes of streets starting with {Vq = 
nvo^ V\ = nv\). Then there is a limiting family of random elements 

StreetyQ^y^{a) = {(i/(a), /(a), i?(a)), a > 0} 

such that, as n tends to infinity, the finite- dimensional distributions (i.e., consid- 
ering finitely many levels) of the rescaled street process, with initial parameters 
(nvo^nvi), converges weakly to the finite dimensional distributions of the said lim- 
iting family. 

Moreover, the point process {iy{a) fl {I {a) + 1, R{a))^ a > 0} satisfies the restric- 
tion consistency property in Definition\iO\ 

Remark. Adding the atom (/(a) + l,i?(a)) is arbitrary. What we want to say is 
clear. If the clock subtree has survived through two levels ao < ai, then one would 
find a jump across level ao such that it also crosses level ai. 

Proof. This proof is now easy since there are no rate functions involved. Suppose 
that the initial parameters are {nvo^nvi). Now consider a sequence of excursions, 
as in the reduced JCCP. Let xq = vq. The first excursion starts from level nxo 
and stops once it exits the interval (0,nxi), where xi max('Uo, '^i). Conditioned on 
the first excursion not hitting zero, define X2 = Xi -\- Xx^\ Now the second one 
starts at nxi and stops once it exits the interval (0,nx2). Recursively, if the first k 
excursions have not touched zero, the + l)st excursion starts at nxk-i and stops 
when it exits (0,nx/c). If it does not exit through zero, define x/e+i = x^ -\- Tx^\ 
This continues until we find an excursion that hits zero. By induction, every finite- 
dimensional distribution, starting from 1, in this sequence converges in law thanks 



to Theorem [26] The fact that we eventually stop is guaranteed by the fact that the 
forest corresponding to the JCCP is critical. 

The rescaled measure on the street process of each individual excursion con- 
verges. A somewhat tedious argument but vey similar to the ones above show a 
joint convergence. □ 

Note that state space of a street is A/m+xr+ x x which is a complete 
separable metric space and hence allows a regular conditional probability. Now, for 
any two levels ao < ai, the conditional distribution of Street (ai) given Street (ao) 
can be calculated any initial parameter (vq^vi) by the Markov property that the 
limiting street derives from the finite trees. Hence one can define unambiguously 
the following transition probability. 



Definition 21. For any pair of levels ai,ao > 0, we define i3(ao,ai) to be the 
regular conditional distribution of Street (ai) given Street (ao). 
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Figure 11. Path decomposition of the reduced JCCP 



To complete the description of the process of streets we wih need a canonical 
entrance distribution. This definition is standard for Markov processes. A family 
of cr-finite measures {/i(a), a > 0} will be called an entrance distribution for the 
semigroup •) if / i3(ao, ai)(i/i(ao) = djii{ai) for every pair of levels ao < ai. 
However, instead of getting into technical details regarding Markov processes and 
semigroups we simply define a canonical distribution for the initial jumps (Vb, T^i) 
for the finite reduced JCCP and take a limit. 

Unfortunately this entrance distribution cannot be obtained directly by consid- 
ering a sapling with exactly two leaves (the F-tree) in Figure [sj The reason being 
that the one leaf subtrees are not large enough to be significant under our scaling. 
Heuristically, one of the subtrees containing these leaves will die too soon, and, 
although the other subtree might remain large, we are left with a trivial point pro- 
cess. Hence the entrance law has to be guessed from the excursions of the reduced 
JCCP itself. 

The solution to this problem follows from a path decomposition of the final image 
in Figure 10 which we have described in Figure [TT] The top image in Figure 11 is 



identical to the final image in Figure 10 Notice, by independent increment and the 
strong Markov property of the compound Poisson process, the reduced JCCP can 
be seen as being constituted of two paths of the JCCP (not-reduced) interwoven 
together, until one of the paths hit zero. 

More specifically, we start with two paths of the JCCP that start with a jump 
at time zero and stopped when they return to zero. We call the path on the left of 



the second row of Figure 11 to be the 0-JCCP (since it starts with a jump of size 



Vb), and we call the other one to be the 1-JCCP (since its starts with jump size 
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Vi). The way they are interwoven is clear: we start with the current supremum 
of the 0-JCCP and observe the 1-JCCP until it exceeds this level and note its end 
level. Then we start the 0-JCCP until it exceeds the previous level and so on. At 
any moment if any of the two JCCP's hit zero we stop both the processes. In our 
given image, the 0-JCCP hits zero first, and hence we draw the rest of the path for 
the 1-JCCP in dots to show that it does not exist in the original reduced JCCP. 
There is only one way of defining a natural excursion of the and 1 JCCP. 



Consider the compound Poisson process (with drift) X as defined in (16) starting 
at zero. Let It = inf{Xs, < s < t} denote the running infimum. Consider the 
process reflected at this running infimum: Xt — It, for t > 0. Then, this naturally 
decomposes the paths of X in a sequence of excursions whose paths are similar to 
paths of JCCP and 1. Another way to describe this excursion is start with a 
jump of size /q + Jo as given in Lemma [2l] and then follow the process X until we 
return to zero. 

Thus the canonical choice of Vq^Vi is a pair of iid samples with a distribution 
function 

P{Vo >x) = P{Vi >x) = Po(/o + Jo > 
= xL{x)^G{x), x>0. 

Note that 



lim nP{Vo>nvo,Vi>nvi)= (2-^^^^2-^^A^—L= 

n^oo \ / ./VnV^ 



^/VoVl 2^/2Vo 2yj2Vx 

which gives us the rate function for the initial jumps CK), Vi). 

Lemma 28. For any level a > and any n G consider an excursion of the 
reflected process X — I, starting from zero, conditioned to hit na and stopped once 
it goes below zero. 

(i) Consider the joint distribution of the point process of undershoots below 
level na, indexed in the reverse order that they appear (from right to left), 
without counting the final undershoot, Jn{o) (the first undershoot in the for- 
ward order). Then, as n tends to infinity, the distribution of {Yin^Jnio)), 
suitably rescaled, converges to a probability measure 7r(a) on A/m+m+ x R+. 
In particular, suppose (n(a), J(a)) ~ tt^ then n(a) and J{a) are indepen- 
dent. n(a) is compactly supported with length Exponential with mean 

X(a) = — ^^\fa. 

Moreover n(a) is a Poisson point on (0,A(a)) x R+ with a rate LEB x 



g*{v)dv, where g* was defined in (25) 



(ii) Consider Yq and Yi to be two iid samples from the reflected process. Define 
the following events 

Eo{a) := {suplo > na — Jn(a;0), supYi > na} , 

Ei{a) := {sup Yi > na — Jn{ci] 1), supYb > • 

where Jnia; i) is the first undershoot below level na for the process Yi. 
Then, the following limit exists: 

(28) cr(a) := lim nP {Eo{a) U Ei{a)) . 
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Proof. The proof of the lemma fohows from very similar calculations done before. 
The length of the limiting point process follows since, starting at na, the process X 
has 1/W{na) chance of hitting zero before na again. Thus, after the first upcrossing 
of na by X, there are Geometric(l/VK(na)) many upcrossings of level na, each 
of density described in Lemma [22] The Geometric distribution converges to 
Exponential with the stated mean and the density converges to □ 

Theorem 29. For any level a, let Street{a) = {u^I^R) denote a (possibly empty) 
street. The following serves as an entrance distribution /i(a) for the semigroup 
S){-,-) described above. Under fi{a), the street is non-empty with rate o-{a), where 



a{a) is defined in (28). Let T{a) be the probability measure on streets, conditioned 
to be non-empty. 

(i) Under V{a), the random variable R, has the distribution of J {a) in Lemma 



(ii) Given that v is non-empty, iVs distribution is 7r{a) as given in Lemma 28 



Proof. Now, if the initial parameters {Vq^ Vi) are iid then clear the 0-JCCP and the 
1-JCCP are iid too. Let Mo{s) and Mi{s) denote the running supremum of the 
and 1 JCCP respectively, where Mq and Mi will denote the overall supremum. 
Consider the event, that the reduced JCCP crosses a level a > 0. Now, there 
are three possible cases by which this can happen. 

(i) Both {Mo > a} and {Mi > a}. 

(ii) Either {Mi(oo) > a} and Mi(ra-) < Mq < a, 

(ii') or, same as (ii), with the role of Mq and Mi reversed. 

The three events above are disjoint and their respective asymptotic densities 



(when we replace a by na) can be easily calculated from Lemma 23 

Now, when Mq AMi > a, the point process of undershoots below a level a (except 
the first one) is non-empty. Note that this set of undershoots below level a for the 
reduced JCCP is the union of the the undershoots for the and 1 JCCPs, suitably 
arranged. Since they are all iid, this gives us the limiting distribution of tt. 

In either case, at any level a, the distribution of the street is given by the 
appropriate distributions and we can take limits. As before, the convergence of 
finite dimensional distributions show that the claimed distribution is indeed an 
entrance measure. □ 



7. Dynamics of the mailman 

Finally we arrive at the limiting dynamics of the mailman under the Poissonized 
Markov chain. At this point we do not prove any finite to continuum convergence. 
We simply construct a limiting process with the ingredients derived in the last 
Section, the laws {r(a), a > 0}, and the derived transition kernel S){-,-). 



Construction of a random mailman. We are going to sequentially construct 
an exchangeable sequence of mailmen (X(l), X(2), X(3) . . .). 

Fix an initial level a > 0. Let Streeti(l) = (^i,/i,i^i) be a sample from the 
distribution r(a). Let (Xi(l), (^i(l)) be distributed according to this street. Now, 
given Xi(l), let yi(l) denote the age of the atom at Xi(l). This is merely the mass 
of ui at Xi{l) when ^i(l) = 1, and Ri when ^i(l) = 0. Let ^i(l) be the a-algebra 
generated by (Streeti(l), Xi(l), (^i(l)). 
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Now, given ^i(l), generate Street2(l) according to r(yi(l)), and define X2(l), 
<;^2(1), and 12(1) analogously. Let ^2(1) be the a-algebra generated by all these 
new random elements and ^i(l). By induction, given J-k{^), generate Street/c+i(l) 
according to r(Yfc(l)), and update Let J-'oo(l) = (U/cJ^/c(l)). 

Let the joint law of the family Street/;;(1)), A: = 1, 2, . . .), starting 

at an initial age a, be denoted by 6(a). 

Now, to generate X{2) given ^^00(1)7 start by generating (Xi(2), <^i(l)) as an 
independent sample from Street2(l) := Streeti(l). Let ^i(l) be the enlarged fil- 
tration with the additional information. Now we proceed by induction. Define the 
stopping time 

m(l,2) = min{n > : (X,(l), ^,(1)) ^ (X,(2), ^,(2))} . 

Then, then if {k < m(l,2)}, given the history, generate (X/e+i(2), (^/g+i(2)) from 
Street/e+i(2) := Streetfc+i(l). For the rest of the sequence, given J>n(i,2)(2), the 
shifted family 

((-^m(i,2)+/c(i)7 W(i,2)+/c(i)5Street^(i^2)+/c(i))5 k = 1,2,...), 

is distributed as ©(l^m(i,2)(2)). 

The construction of the entire exchangeable family (X(l), X(2), . . .)(a) now fol- 
lows by an obvious induction. The law of this family on the space of random 
sequences is the address protocol at level a. 

One-step transition of the random mailman. Now suppose we are given a 
pair of levels ao < ai and an instance of the family 

{(X(l),^(l),Street(l)), (X(2), ^(2), Street(2)), . . .} (ao). 

Let Q{ao) be the cr- algebra generated by the entire above family. We now describe 
how to generate an iid sequence of mailmen at level ai, given Q{ao). 

To keep our notation simple, we will denote by X', <;^', Street' to denote the 
corresponding quantities at level ai. We will enlarge the <j-algebras naturally as we 
go. Recall the transition kernel io(ao, ai) from the last section. The law of Street ^ (1) 
given the entire history is equal to the law of the Street '^(1) given Street i(l) which 
is given by io(ao,ai). Call Streeti(l) to be the parent of Street2(l). 

Let (X{(1), <^{(1)) be distributed according to this street, and let Y({1) denote 
of -^{(1). Now, there are two cases to consider: 

Case (i) if 1^/(1) < ai — ao, then the subtree corresponding to this atom did not 
exist at level ao- Hence, we generate Street2(l) as a sample from r(y/(l)). 
Generate the mailman as an independent sample from this street. 

Case (ii) if 17(1) > ai — ao, then, by the restriction consistency property of the 
family {r(a), a > 0}, there is a unique atom on Street'i(l) such that the 
age of that atom is exactly 1^(1) — ai + ao > 0. Now since this atom has 
positive probability, there exists a no such that Yi{no) is equal to this age. 
The distribution of Street2(l), given the entire history, is taken to be the 
distribution of Street2(l) given Street2(no), which is a sample distributed 
according to S^{Yi{no), ai). Generate the mailman as an independent sam- 
ple from this street. 

We call Street2(no) to be the parent of Street2(l). Note that this defi- 
nition does not change if rii is another index such that Yi{no) = ^1(711). 
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For the induction step, suppose Street';. (1) has been generated and ^^'(1) is the 
age of the mailman (X^(l), Then, we consider cases (i) and (ii) as above. 

For case (i). Street '^.^^(l) is generated according to r(Y^'(l)). 

For case (ii), suppose that the difference Y^{1) — ai + ag > 0. Then, necessarily, 
the ages of each of . . . , X^(l) is greater than ai — ag. Since, if ^^^(1) has age 

less than ai — ao, then so does X^'^^(l) for all k eN. 

Then, every street Street'i(l), Street2(l), . . . , Street';.(l) has a corresponding par- 
ent at level ao- And, in fact, almost surely there exists such that each of 
Streeti(n/e), Street2(n/e), . . . , Street ^(n/e) is a parent of the corresponding streets at 
level ai and Yk{nk) = ^/^(l) — cli + a^. The distribution of Street^^i(l), given 
the history, is identical to the law of Street^^]^(l) given Streetjt+i(njt) and is given 
by S){Yj.{nk)^ai). The mailman is an independent sample from this street. By 
induction this generates the process <^'(1), Street' (1) given the history so far. 

Now the reader can guess how to generate X'(2), <;-'(2), Street' (2). If we hit an age 
of an atom that has already appeared in the next street is kept unchanged, 

and the mailman is an independent sample from that street. If we hit an atom that 
has not appeared in -^'(1) we proceed by case (i) or (ii) as described above. And 
so on for the rest of {X\ Street'). 

This defines a joint distribution of countable collection of random sequences at 
levels ao and ai . Suppose these random sequences are almost surely in , they are 
the set representations of a pair of continuum trees which has evolved according to 
the dynamics of the Poissonized Aldous Markov process. 

Unfortunately, although it seems very intuitive that these are in L^, we cannot 
come up immediately with a quick argument. Hence we leave this as a conjecture 
to be settled in a follow up paper. 

Concluding remark 

The invariant distribution for the finite Aldous Markov chain is the uniform 
distribution on binary trees. Hence it is intuitive that the Brownian CRT is the 
invariant distribution for the limiting diffusion on tree. However, perhaps the most 
unsatisfying aspect of this article that the author cannot see where the CRT is hid- 
den. There are several tell-tale hints, especially in the appearance of the Stable(l/2) 
subordinator. However, a clear connection with the CRT is missing at this point. 
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